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(gj) Non-human Carbonyl hydrolase mutants, DNA sequences and vectors encoding same and hosts transformed with said 
vectors. 

© Novel carbonyl hydrolase mutants derived from the 
amino acid sequence of naturally-occurring or recombinant 
non-human carbonyl hydrolases and DNA sequences encod- 
ing the same. The mutant carbonyl hydrolases, in general, 
are obtained by in vitro modification of a precursor DNA 
sequence encoding the naturally-occurring or recombinant 
carbonyl hydrolase to encode the substitution, insertion or 
deletion of one or more amino acids in the amino acid sequ- 
ence of a precursor carbonyl hydrolase. Such mutants have 
one or more properties which are different than the same 
property of the precursor hydrolase. 
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NON-HUMAN CARBONYL HYDROLASE MUTANTS, 
DNA SEQUENCES AND VECTORS ENCODING SAME 
AND HOSTS TRANSFORMED WITH SAID VECTORS 

The recent development of various in vitro techniques 
to manipulate the DNA sequences encoding 
naturaily-occuring polypeptides as veil as recent 
developments in the chemical synthesis of relatively 
short sequences of single and double stranded DNA has 
resulted in the speculation that such techniques can 
be used to modify enzymes to improve some functional 
property in a predictable way. Ulmer, K.M. (1983) 
Science 219 , 666-671. The only working example 
disclosed therein is the substitution of a single 
amino acid within the active site of tyrosyl-tRNA 
synthetase (Cys35-Ser) which lead to a reduction in 
enzymatic activity. See Winter, G., et al. (1982) 
Nature 299 . 756-758; and Wilkinson, A.J., et al. 
(1983) Biochemistry £2, 3581-3586 (Cys35-Gly mutation 
also resulted in decreased activity). 

When the same t-RNA synthetase was modified . by 
substituting a different amino acid residue within the 
.active site with two different amino acids, one of the 
mutants (Thr51-Ala) reportedly demonstrated a 
predicted moderate increase in kcat/Km whereas a 
second mutant (ThrSl^Pro) demonstrated a massive 
increase in kcat/Xm which could not be explained with 
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certainty. Wilkinson, A.H., et al. (1984) Nature 307 , 
187-188. 

Another reported example of a single substitution of 
an amino acid residue is the substitution of cysteine 
for isoleucine at the third residue of T4 lysozyme. 
Perry, L.J., et al. (1984) Science 226 , 555-557. The 
resultant mutant lysozyme was mildly oxidized to form 
a disulfide bond between the new cysteine residue at 
position 3 and the native cysteine at position 97. 
This crosslinked mutant was initially described by the 
author as being enzymatically identical to, but more 
thermally stable than, the wild type enzyme. However, 
in a "Note Added in Proof", the author indicated that 
the enhanced stability observed was probably due to a 
chemical modification of cysteine at residue 54 since 
the mutant lysozyme with a free thiol at Cys54 has a 
thermal stability identical to the wild type lysozyme. 

Similarly, a modified dihydrofolate reductase from 
E.coli has been reported to be modified by similar 
methods to introduce a cysteine which could be 
crosslinked with a naturally-occurring cysteine in the 
reductase. Villafranca, D.E., et al. (1983) Science 
222, 782-788. The author indicates that this mutant 
is fully reactive in the reduced state but has 
significantly diminished activity in the oxidized 
state. In addition, two other substitutions of 
specific amino acid residues are reported which 
resulted in mutants which had diminished or no 
activity. 

EPO Publication No. 0130756 discloses the substitution 
of specific residues within B. amyloliquefaciens 
subtilisin with specific amino acids. Thus, Met222 
has been substituted with all 19 other amino acids, 
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Glyl66 with 9 different amino acids and Glyl69 with 
Ala and Ser. 

As set forth below, several laboratories have also 
reported the use of site directed mutagensis to. 
produce the mutation of more than one amino acid 
residue within a polypeptide. 

The amino-terminal region of the signal peptide of the 
prolipoprotein of the E. coli outer membrane was 
stated to be altered by the substitution or deletion 
of residues 2 and 3 to produce a charge change in that 
region of the polypeptide. Inoyye, S., et al. (1982) 
Proc. Nat. Acad, Sci. USA 79 , 3438-3441. The same 
laboratory also reported the substitution and deletion 
of amino acid redisues 9 and 14 to determine the 
effects of such substitution on the hydrophobic region 
of the same signal sequence, Inouye, S., et al. 
(1984) J, Biol, Chem. 259 , 3729-3733. 

Double mutants in the active site of tyrosyl-t-RNA 
synthetase have also been reported. Carter, P.J., et 
al. (1984) Cell 38, 835-840. In this report, the 
improved affinity of the previously described 
Thr51+Pro mutant for ATP was probed by producing a 
second mutation in the active site of the enzyme. One 
of the double mutants, Gly35/Pro51, reportedly 
demonstrated an unexpected result in that it bound ATP 
in the transition state better than was expected from 
the two single mutants. Moreover, the author warns, 
at least for one double mutant, that it is not readily 
predictable how one substitution alters the effect 
caused by the other substitution and that care must be 
taken in interpreting such substitutions. 
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A mutant is disclosed in U.S. Patent No. 4,532,207, 
wherein a polyarginine tail was attached to the 
C-terminal residue of 0-urogastrone by modifying the 
DNA sequence encoding the polypeptide. As disclosed, 
the polyarginine tail changed the electrophoretic 

5 mobility of the urogastrone-polyaginine hybrid 
permit ing selective purification. The polyarginine 
was subsequently removed, according to the .patentee, 
by a polyarginine specific exopeptidase to produce .the 
purified urogastrone. Properly construed, this 

10 reference discloses hybrid polypeptides which do not 
constitute mutant polypeptides containing the 
substitution, insertion or deletion of one or more 
amino acids of a naturally occurring polypeptide. 

15 Single and double mutants of rat pancreatic trypsin 
have also been reported. CraiJc, C.S., et al. (1985) 
Science 228 , 291-297. As reported, glycine residues 
at positions 216 and 226 were replaced with alanine 
residues to produce three trypsin mutants (two single 

2o mutants and one double mutant) . In the case of the 
single mutants, the authors stated expectation was to 
observe a differential effect on Km. They instead 
reported a change in specificity (kcat/Km) which was 
primarily the result of a decrease in kcat. In 

25 contrast, the double mutant reportedly demonstrated a 
differential increase in Km for lysyl and arginyl 
substrates as compared to wild type trypsin but had 
virtually no catalytic activity. 

3 q The references discussed above are provided solely for 
their disclosure prior to the filing date of the 
instant case, and nothing herein is to be construed as 
an admission that the inventors are not entitled to 
antedate such disclosure by virtue of prior invention 

35 or priority based on earlier filed applications. 
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Based on the above references, however, it is apparent 
that the modification of the amino acid sequence of 
wild type enzymes often results in the decrease or 
destruction of biological activity, 

5 Accordingly, it is an object herein to provide 
carbonyl hydrolase mutants which have at least one 
property which is different from the same property of 
the carbonyl hydrolase precursor from which the amino 
acid of said mutant is derived. 

10 

It is a further object to provide mutant DNA sequences 
encoding such carbonyl hydrolase mutants as well as 
expression vectors containing such mutant DNA 
sequences. 

15 

Still further, another object of the present invention 
is to provide host cells transformed with such vectors 
as well as host cells which are capable of expressing 
such mutants either intracellularly or 
20 extraceilularly. 
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g„,mn*r Y of the Inv ention 

The invention includes carbonyl hydrolase mutants, 
preferably having at least one property which, is, 
substantially different from the same property of the 
precursor non-human carbonyl hydrolase from which the 
5 amino acid sequence of the mutant is derived. These 
properties include oxidative stability, substrate, 
. specificity catalytic activity, thermal stability, 
alkaline stability, P H activity profile and resistance 
to proteolytic degradation. The precursor carbonyl 
10 hydrolase may be naturally occurring carbonyl 
hydrolases or recombinant carbonyl hydrolases. The 
amino acid sequence of the carbonyl hydrolase mutant 
is derived by the substitution, deletion or insertion 
of one or more amino acids of the. precursor carbonyl 
15 hydrolase amino acid sequence. 

The invention also includes mutant DNA sequences 
encoding such carbonyl hydrolase mutants. Further the 
invention includes expression vectors containing such 
mutant DNA sequences as well as host cells transformed 
with such vectors which are capable of expressing said 
carbonyl hydrolase mutants. 
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Brief Description of the Drawings 

Figure 1 shows the nucleotide sequence of the coding 
strand, correlated with the amino acid sequence of B. 
»» Y i nU guefaciens subtilisin gene. Promoter (p) 
ribosome binding site (rbs) and termination (term) 
regions of the DNA sequence as well as sequences 
encoding the presequence (PRE) putative prosequence 
(PRO) and mature form (MAT) of the hydrolase are also 
shown. 

Figure 2 is a schematic diagram showing the substrate 
binding cleft of subtilisin together with substrate. 
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Figure 3 is a stereo view of the S-l binding subsite 
of B. amvlolicruefaciens subtilisin showing a lysine 
P-l substrate bound in the site in two' different ways. 
Figure 3 A shows Lysine P-.l substrate bound to form a 
salt bridge with a Glu at position 156. Figure 3B 
shows Lysine P-l substrate bound to form a salt bridge 
with Glu at position 166. 

Figure 4 is a schematic diagram of the active site of 
subtilisin Asp32, His64 and Ser221. 

Figures 5A and 5B depict the amino acid sequence of 
subtilisin obtained from various sources. The 
residues directly beneath each residue of B. 
amyloliquefaciens subtilisin are equivalent residues 
which (1) can be mutated in a similar manner to that 
described for B. amyloliquefaciens subtilisin, or (2) 
can be used as a replacement amino acid residue in B. 
amyloliquefaciens subtilisin. Figure 5C depicts 
conserved residues of B. amyloliquefaciens subtilisin 
when compared to other subtilisin sequences. 

Figures 6A and 6B depict the inactivation of the 
mutants Met222L and Met222Q when exposed to various 
organic oxidants. 

Figure 7 depicts the ultraviolet spectrum of Met222F 
subtilisin and the difference spectrum generated after 
inactivation by diperdodecanoic acid (DPDA) . 

Figure 8 shows the pattern of cyanogen bromide digests 
of untreated and DPDA oxidized subtilisin Met222F on 
high resolution SDS-pyridine peptide gels. 
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Figure 9 depicts a map of the cyanogen bromide 
fragments of Fig. 8 and their alignment with the 
sequence of subtilisin Met222F. 

Figure 10 depicts the construction of mutations 
between codons 45 and 50 of £. amvloliguef aciens 
subtilisin. 

Figure 11 depicts the construction, of mutations 
between codons 122 and 127 of g. amvloliguef aciens 
subtilisin. 

Figure 12 depicts the effect of DPDA on the activity 
of subtilisin mutants at positions 50 and 124 in 
subtilisin Met222F. 

Figure 13 depicts the construction of mutations at 
codon 166 of g. amyloliquef aciens subtilisin. 

Figure 14 depicts the effect of hydrophobicity of the 
P-l substrate side-chain on the kinetic parameters of 
wild-type g. amyloliquef aciens subtilisin. 

Figure 15 depicts the effect of position 166 
side-chain substitutions on P-l substrate specificity. 
Figure 15A shows position 166 mutant subtil isins 
containing non-branched alkyl and aromatic side-chain 
substitutions arranged in order of increasing 
molecular volume. Figure 15B shows a series of mutant 
enzymes progressing through p- and 7-branched 
aliphatic side chain substitutions of increasing 
molecular volume. 

Figure 16 depicts the effect of position 166 
side-chain volumn on log kcat/Km for various P-l 
substrates. 
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Figure 17 shows the substrate specificity differences 
between Ilel66 and wild-type (Glyl66) B. 
amvloliquef aciens subtilisin against a series of 
alphatic and aromatic substrates. Each bar represents 
the . difference in log kcat/Km for . Ilel66 . minus 
wild-type (Glyl66) subtilisin. 

Figure 18 .depicts the construction of mutations at 
codon 169 of B. amyloliouef aciens subtilisin. 

Figure 19 depicts the construction of mutations at 
codon 104 of B. amyloliouef aciens subtilisin. 

Figure 20 depicts the construction of mutations at 
codon 152 fi. amvlolicruef aciens subtilisin. 

Figure 21 depicts the construction of single mutations 
at codon 156 and double mutations at codons 156 and 
166 of B. amvloliquef aciens subtilisin. 

Figure 22 depicts the construction of mutations at 
codon 217 for B. amvlolicruef aciens subtilisin. 

Figure 23 depicts the kcat/Km versus pH profile for 
mutations at codon 156 and 166 in B. amvloliquef aciens 
subtilisin. 

Figure 23A depicts the kcat/Km versus pH profile for 
mutations at codon 156 and 166 in B. amyloliquef aciens 
subtilisin. 

Figure 24 depicts the kcat/Km versus- pH profile for 
mutations at codon 222 in B. amvloliq uef aciens 
subtilisin. 
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various wild type and mutant b 
substrates . 

. x b C and D depict the effect of charge 
Figures 28 A f B ^ C substituti ons at codon 

in the P-l binding sites 
156 and 166. 

„ * a stereoview of the P-l binding 
Figures 29 A and B are a stere subst rate 
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„ depicts the strategy for producing point 
Figure 31 f^J^^ coding sequence by «i.xn- 
Butations in the subtilise triphosphates, 
corporation of "-thioldeoxynucleotide 

107V/213R at alkaline pH. 

. wild type and mutant m 
F50/V107/R213 at alkaline P H. 
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Figure 34 depicts the strategy for constructing 
plasmids containing random cassette mutagenesis over 
residues 197 through 228. 

Figure 35 depicts the oligodeoxynucleotides used for 
random cassette mutagenesis over residues 197 through 
228. 

Figure 36 depicts the construction of mutants at codon 
204. 

Figure 37 depicts the oligodeoxynucleotides used for 
synthesizing mutants at codon 204 k 

Detailed Description 

The inventors have discovered that various single and 
multiple in vitro mutations involving the 
substitution, deletion or insertion of one or more 
amino acids within a non-human carbonyl hydrolase 
amino acid sequence can confer advantageous properties 
to such mutants when compared to the non-mutated 
carbonyl hydrolase. 

Specifically, B. amvloliauefaciens subtilisin, an 
alkaline bacterial protease, has been mutated by 
modifying the DNA encoding the subtilisin to encode 
the substitution of one or more amino acids at various 
amino acid residues within the mature form of the 
subtilisin molecule. These in vitro mutant 
subtilisins have at least one , property which is 
different when compared to the same property of the 
precursor subtilisin. These modified properties fall 
into several categories including: oxidative 
stability, substrate specificity, thermal stability, 
alkaline stability, catalytic activity, pH activity 
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profile, resistance to proteolytic degradation, Km, 
kcat and Km/kcat ratio. 

Carbonyl hydrolases are enzymes which hydrolyze 

0 

II 

compounds containing C-X bonds in which X is oxygen or 
nitrogen. They include naturally-occurring carbonyl 
hydrolases and recombinant carbonyl hydrolases. 
Naturally occurring carbonyl hydrolases principally 
include hydrolases, e.g. lipases and peptide 
hydrolases, e.g. subtilisins or metalloproteases. 
Peptide hydrolases include a-aminoacylpeptide 
hydrolase, peptidylamino-acid hydrolase, acylamino 
hydrolase, serine carboxypeptidase, metallocarboxy- 
peptidase, thiol proteinase, carboxylproteinase and 
metalloproteinase. Serine, metallo, thiol and acid 
proteases are included, as well as endo and exo- 
proteases. 

"Recombinant carbonyl hydrolase" refers to a carbonyl 
hydrolase in ' which the DNA sequence ' encoding the 
naturally occurring carbonyl hydrolase is modified to 
produce a mutant DNA sequence which encodes the 
substitution, insertion or deletion of one or more 
amino acids in the carbonyl hydrolase amino acid 
sequence. Suitable modification methods are disclosed 
herein and in EPO Publication No. 0130756 published 
January 3, 1935. 

- Subtilisins are bacterial carbonyl hydrolases which 
generally act to cleave peptide bonds of proteins or 
peptides. As used herein, "subtilisin" means a 
naturally occurring subtilisin or a recombinant 
subtilisin. A series of naturally 'occurring 
subtilisins is known to be produced and often secreted 



by various bacterial species. Amino acid sequences of 
the members of this series are not entirely 
homologous. However, the subtilisins in this series 
exhibit the same or similar type of proteolytic 
activity. This class of serine proteases shares a 
common amino acid sequence defining a catalytic triad 
which distinguishes them from the chymotrypsin related 
class of serine proteases. The subtilisins and 
chymotrypsin related serine proteases both have a 
catalytic triad comprising aspartate, histidine and 
serine. In the subtilisin related proteases the 
relative order of these amino acids, reading from the 
amino to carboxy terminus is aspartate-histidine- 
serine. In the chymotrypsin related proteases tne 
relative order, however is histidine-aspartate-serine. 
Thus, subtilisin herein refers to a serine protease 
having the catalytic triad of subtilisin related 
proteases. 

"Recombinant subtilisin" refers to a subtilisin in 
which the DNA sequence encoding the subtilisin is 
modified to produce a mutant DNA sequence which 
encodes the substitution, deletion or insertion of one 
or more amino acids in the naturally occurring 
subtilisin amino acid sequence. Suitable methods to 
produce such modification include those disclosed 
herein and in EPO Publication No. 0130756. For 
example, the subtilisin multiple mutant herein 
containing the substitution of methionine at amino 
acid residues 50, 124 and 222 with phenylalanine, 
isoleucine and glutamine, respectively, can be 
considered to be derived from the recombinant 
subtilisin containing the substitution of glutamine at 
residue 222 (Q222) disclosed in EPO Publication No. 
0130756. The multiple mutant thus is produced by the 
substitution of phenylalanine for methionine at 
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residue 50 and isoleucine for methionine at residue 
124 in the Q222 recombinant subtilisin. 

"Carbonyl hydrolases" and their genes may be obtained 
fr om many procaryotic and eucaryotic 

5 suitable examples of procaryotic organisms include 
gra m negative organisms such as E. cpli or pseudomonas 
L gram positive bacteria such as micrococcus or 
bacillus. Examples of eucaryotic organisms fromwhich 
carbonyl hydrolase and their genes may be obtained 

10 include yeast such as S. c^evis^, fungi such as 
Aspergillus sp., and non-human mammalian sources such 
as! for example, Bovine s P . from vhich the gene 
encoding the carbonyl hydrolase chymosin can be 
obtained. As with subtilisins, a series of carbonyl 

15 hydrolases can be obtained from various related 
species which have amino acid sequences vhich are not 
entirely homologous between the members of that series 
but which nevertheless exhibit the same or similar 
type of biological activity. Thus, non-human carbonyl 

20 hfarolase as used herein has a functional 

which refers to carbonyl hydrolases which are 
associated, directly or indirectly, with procaryotic 
and non-human eucaryotic sources. 
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A -carbonyl hydrolase mutant" has an amino acid 
sequence which is derived from the amino acid sequence 
of a non-human "precursor carbonyl hydrolase". The 
precursor carbonyl hydrolases include naturally- 
occurring carbonyl hydrolases and recombinant carbonyl 
hydrolases. The amino acid sequence of the carbonyl 
hydrolase mutant is "derived" from the £™ 
hydrolase amino acid sequence by the **>^"' 
deletion or insertion of one or more amino ^ acids of 
the precursor amino acid sequence, such modification 
is of the "precursor DNA sequence" which encodes the 
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amino acid sequence of the precursor carbonyl 
hydrolase rathern than manipulation of the precursor 
carbonyl hydrolase per se . Suitable methods for such 
manipulation of the precursor DNA sequence include 
methods disclosed herein and in EPO Publication No. 
0130756. , 

Specific residues of B. amvlolicruef aciens subtilisin 
are identified for substitution, insertion or 
deletion. These amino acid position numbers refer to 
those assigned to the B. amvlolicruef aciens subtilisin 
sequence presented in Fig. 1. The invention, however, 
is not limited to the mutation of this particular 
subtilisin but extends to precursor carbonyl 
hydrolases containing amino acid residues which are 
"equivalent" to the particular identified residues in 
B. amvlolicruef aciens subtilisin. 

A residue (amino acid) of a precursor carbonyl 
hydrolase is equivalent to a residue of B. 
amvlolicruef aciens subtilisin if it is either 
homologous (i.e., corresponding in position in either 
primary or . tertiary structure) or analagous to a 
specific residue or portion of that residue in B. 
amvloliauef aciens subtilisin (i.e., having the same or 
similar functional capacity to combine, react, or 
interact chemically) . 

In order to establish homology to primary structure, 
the amino acid sequence of a precursor carbonyl 
hydrolase is directly comparted to the B. 
amvloliguef aciens subtilisin primary sequence and 
particularly to a set of residues known to be 
invariant in all subtilisins for which sequence is 
known (Figure 5C) . After aligning the conserved 
residues, allowing for necessary insertions and 
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deletions in order to maintain alignment (i.e., 
avoiding the elimination of conserved residues through 
arbitrary deletion and insertion) , the residues 
equivalent to particular amino acids in the primary 
sequence of B. amvloliguefaciens subtilisin are 
defined. Alignment of conserved residues preferably 
should conserve 100% of such residues. However, 
alignment of greater than 75% or as little as 50% of 
conserved residues is also adequate to define 
equivalent residues. Conservation of the catalytic 
triad, Asp32/His64/Ser221 should be maintained. 

For example, in- Figure 5A the amino acid sequence of 
subtilisin from B. amyloiicruef aciens £. subtilisin 
var. 1168 and fi. lichenformis (carlsbergensis) are 
aligned to provide the maximum amount of homology 
between amino acid sequences. A comparison of these 
sequences shows that there are a number of conserved 
residues contained in each sequence. These residues 
are identified in Fig. 5C. 

These conserved residues thus may be used to define 
the corresponding equivalent amino acid residues of B. 
amyloliouef aciens subtilisin in other carbonyl 
hydrolases such as thermitase derived from 
Thermoactinomyces. These two particular sequences are 
aligned in Fig. 5B to produce the maximum homology of 
conserved residues. As can be seen there are a number 
of insertions and deletions in the thermitase sequence 
as compared to £. amyloiicruef aciens subtilisin. Thus, 
in thermitase the equivalent amino acid of Tyr217 in 
fi. amvloliguefaciens subtilisin is the particular 
lysine shown beneath Tyr217. 



In Fig. 5A, the equivalent amino acid at position 217 
in £. amvloliguefaciens subtilisin is Tyr. Likewise, 
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in B. subtilis subtilisin position 217 is also 
occupied by Tyr but in g. licheniformis position 217 
is occupied by Leu. 

Thus, these particular residues in thermitase, and 
.subtilisin from B. subtilisin and B. licheniformis may 
be substituted by a different amino acid to produce a 
mutant carbonyl hydrolase since they are equivalent in 
primary structure to Tyr 2 17 in B. amyloliouefaciens 
subtilisin. Equivalent amino acids of course are not 
limited to those for Tyr217 but extend to any residue 
which is equivalent to a residue in B.' amvloliaue - 
faciens whether such residues are conserved or not. 

Equivalent residues homologous at the level of 
tertiary structure for a precursor carbonyl hydrolase 
whose tertiary structure has been determined by x-ray 
crystallography, are defined as those for which the 
atomic coordinates of 2 or more of the ' main chain 
atoms of a particular amino acid residue of the 
precursor carbonyl hydrolase and B. ' amvloliouefaciens 
subtilisin (N on N, CA on CA, C on C, and 0 on 0) are 
within 0.13nm and preferably O.lnm after alignment. 
Alignment is achieved after the best model has been 
oriented and positioned to give the maximum overlap of 
atomic coordinates* of non-hydrogen protein atoms of 
the carbonyl hydrolase in question to the B. 
amyloliouefaciens subtilisin. The best model is the 
crystallographic model giving the lowest R factor for 
experimental diffraction data at the highest 
resolution available. 

2|Fo(h)|-|Fc(h)| 
R factor - ~ s | ro(h)| 
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Equivalent residues which are functionally analogous 
to a specific residue of B. amyloliguef acieris 
subtilisin are defined as those amino acids of the 
precursor carbonyl hydrolases which may adopt a 
conformation such that they either alter, modify or 
contribute to protein structure, substrate binding or 
catalysis in a manner defined and attributed to a 
specific residue of the B. amyloliguefaciens 
subtilisin as described herein. Further, they are 
those residues of the precursor carbonyl hydrolase 
(for which a tertiary structure has been obtained by 
x-ray crystallography), which occupy an analogous 
position to the extent that although the main chain 
atoms of the given residue may not satisfy the 
criteria of equivalence on the basis of occupying a 
homologous position, the atomic coordinates of at 
least two of the side chain atoms of the residue lie 
with 0.13nm of the corresponding side chain atoms of 
g. amvloliauefaciens subtilisin. The three 
dimensional structures would be aligned as outlined 
above. 

Some of the -residues identified for substitution, 
insertion or deletion are conserved residues whereas 
others are not. In the case of residues which are not 
conserved, the replacement of one or more amino acids 
is limited to substitutions which produce a mutant 
which has an amino acid sequence that does not 
correspond to one found in nature. In the case of 
conserved residues, such replacements should not 
result in a naturally occurring sequence. The carbonyl 
hydrolase mutants of the present invention include the 
mature forms of carbonyl hydrolase mutants as well as 
the pro- and prepro-forms of such hydrolase mutants. 
The prepro-forms are the preferred construction since 
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this facilitates the expression, secretion and 
maturation of the carbonyl hydrolase mutants. 

"Expression vector" refers to a DNA construct 
containing a DNA sequence which is operably linked to 
a suitable control sequence capable of effecting the 
expression of said DNA in a suitable host. Such 
control sequences include a promoter to effect 
transcription, an optional operator sequence to 
control such transcription, a sequence encoding 
suitable mRNA ribosome binding sites, and sequences 
which control termination of transcription and 
translation. The vector may be a plasmid, a phage 
particle, or. *$imply a potential genomic insert. Once 
transformed into a suitable host, the vector may 
replicate and function independently of the host 
genome, or may, in some instances, integrate into the 
genome itself. In the present specification, 
"plasmid" and "vector" are sometimes used 
interchangeably as the plasmid is the most commonly 
used form of vector at present. However, the 
invention is intended to include such other forms of 
expression vectors which serve equivalent functions 
and which are, or become, known in the art. 

The "host cells" used in the present invention 
generally are procaryotic or eucaryotic hosts which 
preferably have been manipulated by the methods 
disclosed in EPO Publication No. 0130756 to render 
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them incapable of secreting enzymatically active 
endoprotease. A preferred host cell for expressing 
subtilisin is the Bacillus strain BG203 6 which is 
deficient in enzymatically active neutral protease and 
alkaline protease (subtilisin). The construction of 
strain BG2036 is described in detail in EPO Publicatin 
No. 0130756 and further described by Yang, M.Y., et 
al. (1984) .t. ■Raeteriol. 160 . 15-21. Other host cells 
for expressing subtilisin include Bacillus subtilis 
1168 (EPO Publication No. 0130756) . 



Host cells are transformed or transfected with vectors 
constructed using recombinant DNA techniques. Such 
transformed host cells are capable of either 
replicating vectors encoding the carbonyl hydrolase 
15 mutants or expressing the desired carbonyl hydrolase 
mutant. In the case of vectors which encode the pre 
or prepro form of the carbonyl hydrolase mutant, such 
mutants, when expressed, are typically secreted from 
the host cell into the host cell medium. 



"Operably linked" when describing the relationship 
between two DNA. regions simply means that they are. 
functionally related to each other. For example, a 
presequence is operably linked to a peptide if it 
25 functions as a signal sequence, participating in the 
secretion of the mature form of the protein most 
probably involving cleavage of the signal sequence. A 
promoter is operably linked to a coding sequence if it 
controls the transcription of the sequence; a ribosome 
30 binding site is operably linked to a coding sequence 
if it is positioned so as to permit translation. 

The genes encoding the naturally-occurring precursor 
carbonyl hydrolase may be obtained in accord with the 

35 
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general methods described herein in EPO Publication 
No. 0130756. 

Once the carbonyl hydrolase gene has been cloned, a 
number of modifications are undertaken to enhance the 

5 use of the gene beyond synthesis of the naturally- 
occurring precursor carbonyl hydrolase. Such 
modifications include the production of recombinant 
carbonyl hydrolases as disclosed in EPO Publication 
No. 0130756 and the production of carbonyl hydrolase 

10 mutants described herein. 

The carbonyl hydrolase mutants of the present 
invention may be generated by site specific 
mutagenesis (Smith, M. (1985) Ann, Rev. Genet. 423 ; 

15 Zoeller, M.J., et al. (1982) Nucleic Acid Res. 10, 
6487-6500) , cassette mutagenesis (EPO Publication No. 
0130756) or random mutagenesis (Shortle, D. f et al. 
(1985) Genetics , 110 , 539; Shortle, D., et al. (1986) 
Proteins: Structure, Function and Genetics , 1, 81; 

20 Shortle, D. (1986) J. Cell. Biochem , ^30, 281; Alber, 

T. , et al. (1985) Proc. Natl. Acad, of Sci. , 812, 747; 
Matsumura, M. , et al. (1985) J. Biochem. , 260 , 15298; 
Liao, H., et al. (1986) Proc. Natl. Acad, of Sci. , 83 
576) of the cloned precursor carbonyl hydrolase. 

25 Cassette mutagenesis a,nd the random mutagenesis method 
disclosed herein are preferred. 

The mutant carbonyl hydrolases expressed upon 
transformation of suitable hosts are screened for 
30 enzymes exhibiting one or more properties which are 

substantially different from the properties of the 
precursor carbonyl hydrolases, e.g., changes in 
substrate specificity, oxidative stability, thermal 
stability, alkaline stability, resistance to 

35 
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proteolytic degradation, pH-activity profiles and the 
like. 

A change in substrate specificity is defined as. a 
difference between the kcat/Km ratio of the precursor 
carbonyl hydrolase and that of the hydrolase mutant. 
The kcat/Km ratio is a measure of catalytic 
efficienty. Carbonyl hydrolase mutants with increased 
or diminished kcat/Km ratios are described in the 
examples. Generally, the objective will be to secure 
a mutant having a greater (numerically large) kcat/Km 
ratio for a given substrate, thereby enabling the use 
of the enzyme to more efficiently act on a target 
substrate. A substantial change in kcat/Km ratio is 
preferably at least 2-fold increase or decrease. 
However, smaller increases or decreases in the ratio 
(e.g., at least 1.5-fold) are also considered 
substantial. An increase in kcat/Km ratio for one 
substrate may be accompanied by a reduction in kcat/Km 
ratio for another substrate. This is a shift in 
substrate specificity, and mutants exhibiting such 
shifts have utility where the precursor hydrolase is 
undesirable, .e.g. to prevent undesired hydrolysis of a 
particular substrate in an admixture of substrates. 
Km and kcat are measured in accord with known 
procedures, as described in EPO Publication No. 
0130756 or as described herein. 

Oxidative stability is measured either by known 
procedures or by the methods described hereinafter. A 
substantial change in oxidative stability is evidenced 
by. at least about 50% increase or decrease (preferably 
decrease) in the rate of loss of enzyme activity when 
exposed to various oxidizing conditions. such 
oxidizing conditions are exposure to the organic 
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oxidant diperdodecanoic acid (DPDA) under the 
conditions described in the examples. 

Alkaline stability is measured either by known 
procedures or by the methods described herein. A 
substantial change in alkaline stability is evidenced 
by at least about a. 5% or greater increase or decrease 
(preferably increase) in the half life of the 
enzymatic activity of a mutant when compared to the 
precursor carbonyl hydrolase. In the case of 
subtilisins, alkaline stability was measured as a 
function of autoproteolytic degradation of subtil is in 
at alkaline pH, e.g. for example, O.IK sodium 
phosphate, pH 12 at 25* or 30 4 C. 

Thermal stability is measured either by known 
procedures or by the methods described herein. A 
substantial change in thermal stability is evidenced 
by at least about a 5% or greater increase or decrease 
(preferably increase) in the half-life of the 
catalytic activity of- a mutant when exposed to a 
relatively high , temperature and neutral pH as compared 
to the precursor carbonyl hydrolase. In the case of 
subtilisins, thermal stability is measured by the 
autoproteolytic degradation of subtilisin at elevated 
temperatures and neutral pH, e.g., for example 2mM 
calcium chloride, 50mM MOPS pH 7.0 at 59'C. 

The inventors have produced mutant subtilisins 
containing the substitution of the amino acid residues 
of B. amvloliouefaciens subtilisin shown in Table I. 
The wild type amino acid sequence and DNA sequence of 
B. amyloliquefaciens subtilisin is shown in Fig. 1. 
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uocidue 


Tyr21 


FA 


Thr22 


C 


Ser24 


c 


Asp32 


Q S 


Ser33 


A T 


Asp36 


A G 


Gly46 


V 


Ala48 


E V R 


Ser49 


C L 


Met50 


C F V 


Asn77 


D 


Ser87 


C 


Lys94 


" C 


Val95 


c 


Leu96 


D 


Tyrl04 


A C D 


Ilel07 


V 


GlyllO 


C R 


Metl24 


I L 


Asnl55 


A D H 


Glul56 


Q S 


Glyl66 


C E I 


Glyl69 


C D E 


Lysl70 


E R 


Tyrl71 


F 


Prol72 


E Q 


Phel89 


A C D 


Aspl97 


R A 


Metl99 


I 


Ser204 


C R L 


Lys213 


R T 


Tyr217 


A C D 


Ser221 


A C 



Replacement Amino Acid 
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The different amino acids substituted are represented 
in Table I by the following single letter 
designations: 



Amino acid 

or residue 3-letter 1-letter 

thereof symbol symbol 

Alanine Ala A 

Glutamate Glu E 

Glutamine Gin Q 

Aspartate Asp D 

Asparagine Asn N 

Leucine Leu L 

Glycine Gly G 

Lysine Lys K 

Serine Ser s 

Valine Val V 

Arginine Arg R 

Threonine Thr T 

Proline Pro P 

Isoleucine lie I 

Methionine Met M 

Phenylalanine Phe F 

Tyrosine Tyr Y 

Cysteine Cys C 

Tryptophan Trp W 

Histidine His H 



Except where otherwise indicated by context, wild-type 
amino acids are represented by the above three-letter 
symbols and replaced amino acids by the above 
single-letter symbols. Thus, if the methionine at 
residue 50 in B. amyloliauefaciens subtilisin is 
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replaced by phenylalanine, this mutation (mutant) may 
be designated MetSOF or F50. Similar designations 
are used for multiple mutants. 

In addition to the amino acids used to replace the 
residues disclosed in Table I, other replacements of 
amino acids at these residues are expected to produce 
mutant subtilisins having useful properties. These 
residues and replacement amino acids are shown in 
Table II. 
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Residue 


Replacement 


Tyr-21 


L 


Thr22 


K 


Ser24 


A 


Asp3 2 




Ser33 


G 


Gly46 




Ala48 




Ser49 




Met 50 


L K I ' 


Asn77 


D 


Ser87 


N 


Lys94 


R Q 


Val95 - 


L I 


Tyrl04 




Met 12 4 


K A 


Alal52 


C L I '. 


Asnl55 




G1U156 


ATM] 


Glyl66 




Glyl69 




Tyrl71 - 


K R E ( 


Prol72 


D N 


Phel89 




Tyr217 




Ser221 




Met222 





Each of the mutant subtilisins in Table I contain the 
replacement of a single residue of the fi. 
amvloliquefaciens amino acid sequence . These 
particular residues were chosen to probe the influence 
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of such substitutions on various properties of B. 
amvlolicruefacien subtilisin. 

Thus, the inventors have identified Metl24 and Met222 
as important residues which if substituted with 
another amino acid produce a mutant subtilisin with 
enhanced oxidative stability. For Metl24, Leu and He 
are preferred replacement amino acids. Preferred 
amino acids for replacement of Het222 are disclosed in 
EPO Publication No. 0130756. 

Various other specific residues have also been 
identified as being important with regard to substrate 
specificity. These residues include Tyrl04, Alal52, 
Glul56, Glyl66, Glyl69, Phel89 and Tyr217 for which 
mutants containing the various replacement amino acids 
presented in Table I have already been made, as well 
as other residues presented below for which mutants 
have yet to be made. 

The identification of these residues, including those 
yet to be mutated, is based on the inventors 1 high 
resolution crystal structure of B. amyloliquefaciens 
subtilisin to 1.8 A (see Table III), their experience 
with in vitro mutagenesis of subtilisin and the 
literature on subtilisin. This work and the x-ray 
crystal structures of subtilisin containing covalently 
bound peptide inhibitors (Robertus, J.D., et al. 
(1972) Biochemistry 11. 2439-2449), product complexes 
(Robertus, J.D., et al. (1972) Biochemistry 11, 
4293-4303), and transition state analogs (Matthews, 
D.A., fit al (1975) J. Biol. Chem. 250, 7120-7126; 
Poulos, T.L., et al. (1976) J- Biol. Chem. 251, 
1097-1103), has helped in identifying an extended 
peptide binding cleft in subtilisin. This substrate 
binding cleft together with substrate is schematically 
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diagramemed in Fig. 2, according to the nomenclature 
of Schechter, I., et al, (1967) Biochem Bio. — El^. 
Commun. 27, 157. The scissile bond in the substrate 
is identified by an arrow. The P and P 1 designations 
refer to the amino acids which are positioned 
5 respectively toward the amino or carboxy terminus 
relative to the scissle bond. The S and S 1 
designations . refer to subsites in the substrate 
binding cleft of subtilisin which interact with the 
corresponding substrate amino acid residues. 
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Atomic Coordinates for the 
Apoenzyme Form of 2, Amvloliauefaciene 
Subtilisin to 1.8AResolution 



5 
4 
4 
* 
4 
4 
4 
T 
T 
• 

• 

9 
t 

10 
14 

10 
14 
11 
11 
11 
11 
12 
12 
12 
12 
12 
11 
11 
14 
14 
14 
14 
14 
II 
14 
14 
14 
14 
11 
It 
IV 



l 414 a 

1 41 A C 

1 ALA CO 

2 tLN CA 
2 6L« 0 
2 6l* C6 

2 41 ■ 0£1 

i set a 

J SEt c 

3 SII CI 

4 VAL ■ 
4 VAL C 
4 VAL CI 

4 VAL C62 

5 mo ca 
s no o 

MO CC 
Tit m 
TTt C 
TTI CI 
TTI C01 
TTI CEl 
III cz 

CLT ■ 
4LT C 
VAL « 
VAL C 
VAL CI 
VAL C62 
SCO CA 
Stt 0 
SEt OC 
CL* CA 
6L* 0 
4LM CC 
U« 0E1 
SLE 
ILE 



II 
C 



ILE CI 
XLE C62 



LTS 
LVS C 
ITS CO 
ITS CO 
LTJ mi 
ALA CA 
ALA 0 

mo a 

MO C 
MO CO 
MO CO 
ALA CA 
ALA 0 
LEU 0 
LEU C 
LEU CO 
LEU C01 

mil a 
ais c 

MIS CI 



IT his mux 

IT HIS CE1 

>o . ttt a 



19.494 
11.731 
21. Of* 
17.21* 
11.715 
IS. 3*1 
13.021 
17. 477 
14.735 
11.511 
14.991 
14.129 
14.001 
14.037 
15.314 
14.115 
13.141 
14.343 
15.359 
17.124 
11.437 
10.535 
10.222 
14.444 
12.400 
22.441 
22.343 
11.745 
10*991 
14.419 
14.212 
14.142 
13.944 
22.715 
14.295 
14.554 
11.425 
20.209 
9.132 
9.142 
11.272 
10.454 
11.257 
12.543 
24.474 
9.325 
9.331 
11.332 
11.714 
13.442 
12*211 
11.379 
20.001 
9.013 
7.912 
4.744 
3.111 
0.445 
9.510 
9*701 
9.931 
9.224 
11.443 



53.195 

90.955 

51.511 

49.001 

47.145 

47.305 

41.412 

47.205 

44.911 

45.132 

43.444 

4J.934 

42.422 

42.246 

41.413 

35.243 

+3.215 

39.240 

34.975 

37.323 

35.452 

34.970 

33.154 

37.362 

34.535 

37.529 

34.433 

31.900 

99.919 

35*342 

93*014 

04.7+7 

32.434 

90.442 

31.417 

33.041 

32*575 

31.792 

32.449 

32*455 

32.115 

33*004 

30*444 

21*517 

27.410 

35*191 

35*104 

35*951 

95.557 

04*510 

35*934 

93*450 

33*710 

04.131 

05*925 

94*423 

13*234 

04.021 

97.911 

99.209 

99.027 

39.914 

97*933 



-21.754 

•21.324 

-21.113 

•2l.*34 

-21.491 

•21.927 

-22.147 

-19.152 

-19.490 

-10.049 

-19.725 

-10*290 

-20.122 

-22.116 

-14.027 

-17*144 

-15.921 

-15.417 

-15.521 

-14*134 

-14.344 

-14.653 

-15.421 

-24*430 

-15.470 

-14.541 

-11*735 

-11*547 

-17*733 

-19.542 

-19.901 

-20*351 

-14.074 

•37.413 

-14*511 

-12*744 

-17.470 

-19*405 

-17.475 

-13.941 

-20.277 

-22*522 

•22.214 

-22.159 

-20*935 

-22.431 

-24*901 

-23*093 

-24*317 

-24*692 

-22*751 

-27*347 

-29*271 

-27*249 

-20.321 

-24*491 

-27*009 

-27.922 

-29*090 

•27*652 

-25*272 

-24.144 

-90*022 



5 
5 
5 
S 
4 
4 
4 
4 
4 
4 
7 
T 
0 
0 
0 
9 
9 
9 

10 

10 

10 

20 

20 

11 

11 

21 

11 

12 

12 

12 

32 

13 

13 

13 

24 

24 

34 

IS 

19 

15 

14 

14 

14 

14 

IT 

IT 

IT 

IT 

IT 

19 



1 ALA C« 

1 ALA 0 

2 sl« a 

2 CLM C 

2 6L* CI 

2 M CO 

2 CLN ftC2 
9 SCI CA 
9 SEt 0 

3 SEt 06 

4 VAL. CA 
4 VAL 0 

4 VAL CC1 

mo a 

MO C 
MO CI 
MO CO 
Tit CA 
TTI 0 
TTI CC 
TTt C02 
TTI CE2 
TTt OH 
CLT CA 
CLT 0 
VAL CA 
VAL 0 
VAL C61 

SEt a 

SEt C 
SEt CI 

cl« a 

BLR C 
61* CO 
61* CO 
6L* *E2 
ILE CA 
ILE 0 
ILE C61 
ILE CD1 
LTS CA 
LTS 0 
LTS CC 
LTS CE 
ALA a 
ALA C 
ALA CO 
MO CA 
MO 0 
MO CC 
ALA B 
ALA C 
ALA CO 
LEU CA 
LEU 0 
LEU CC 
LEU CD2 

ais ca 

ais a 

ais cc 

ais coi 

ais ar2 

SEt CA 



19.911 
11.374 
10.241 
17.175 
14.125 
13.912 
14.115 
17.950 
15.590 
17.412 
15.944 
17*123 
14.174 
15.239 
15.501 
14.150 
14.944 
14.421 
15.224 
10.021 
17.494 
17.115 
10.312 
13.211 
11.747 
11.777 
11.139 
11.104 
13.441 
14.111 
15.924 
14.115 
12.417 
14.125 
14.414 
14.952 
10.373 
9.173 
9*044 
7.511 
11.311 
10.170 
12.213 
13*023 
10.119 
10.024 
0.015 
11.915 
11. TT| 
19.920 
11.940 
10*002 
11*992 
T.T91 
7.342 
3.794 
4*494 
0*090 
9*107 
9*115 
0*090 
0*079 
11.119 



91.774 
91.197 
49.116 
47.704 
41.740 
47.742 
44.917 
45.941 
45.352 
44.210 
42.419 
41.171 
41.572 
42.104 
39.905 
41.110 
42.914 
37.103 
33.943 
35.147 
34.901 
33.539 
31.131 
34.440 
35.471 
37.323 
35.714 
31.193 
34.311 
33.920 
93.432 
13.117 
31.117 
32.115 
31.911 
30.940 
31.904 
31.333 
34.117 
34.441 
32.119 
32.793 
29.930 
27.447 
34.131 
35*714 
34.195 
94.430 
34.947 
-94.971 
94.234 
93.795 
01.949 
04.951 
04.124 
99.445 
92.217 
91.151 
91.422 
99.211 
90.924 
99.321 
04.739 



-21.945 

-21.173 

-22.»4t 

-21.992 

-22.449 

-22.910 

-23.924 

-19.437 

-14.229 

-17.049 

-19.419 

-11.114 

•20.741 

-17.331 

-14.249 

-15.243 

•17.417 

-15.715 

-14.235 

•15.155 

-14*971 

•14.379 

-15.994 

•14.374 

-15.113 

-17*136 

-19.470 

-19.943 

-11.775 

-11.945 

-19.505 

-17.442 

-17.277 

-11.410 

-13.147 

-12.251 

-11.112 

-24.110 

-11.949 

-17.923 

-21.722 

-23.414 

-21.423 

-21.146 

-21.991 

-23.963 

-21.365 

-25.120 

-27.445 

-23.221 

-24.129 

-21.932 

-27.942 

-27.921 

-29.311 

-24.522 

-24.213 

-21.931 

-31.996 

•24.242 

•25.494 

-24.311 

-31.322 
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ii 

SI 

II 
If 
It 
It 
It 
II 
to 
II 
II 
II 
II 

II 

II 

It 

II 

II 

II 

II 

I* 

I* 

14 

II 
II 
If 
II 
II 
II 
II 
II 

IT 

IT 

II 

IT 

II 

II 

II 

II 

II 

It 

SI 
II 
10 
90 
II 
II 
II 
II 
II 
II 
II 
It 
II 
II 
II 
I* 
I* 
11 
II 
II 
II 
II 



111 c 
III CO 
ft l* N 

•l« € 
Ilk Cft 
•l* CD 

il« mi 

•IT CI 

ht e 

Til CI 

tii e 

TTI CI 
TTI COI 
TTI ClI 
TTI 0* 



TMl CI 
TM 0 
Tut 061 



OUT 
SIT 
III 
III 



k 

C 
« 

c 

III CI 
AM ft 
AM C 
Alt Ct 
AH 60! 
f At ft 
VAl C 
VII CI 
VAl Ct* 
ITS CI 
ITS 0 
iTl Ct 
US CI 
VAl H 
Til C 
fit Ct 
til Ct! 
Ill CI 
Ik* 0 
TIL M 
til C 
til CI 

tii eta 
lit CA 
in e 
III c&i 
in coi 

At* CA 
Alt 0 
A!» CI 
At* 001 
III CA 

lit 0 
III DC 
•IT CA 
ALT 0 
III CI 
III 0 
lit Ct! 
Ill COI 
Alt CA 



It. lit 

u. m 
t.iio 

T.IW 
T.lll 
4.12) 
T.)»2 
A. lit 
4.2*1 
4.111 
1.422 
I.TTI 
I. HO 

i.m 

1.901 

4.1*1 

I.IIT 

4.)19 

1.1)1 
-0.117 
•0.021 
•I-Ill 
-I.T14 
•I. tit 
-I. Oil 

-i.m 

•4,1*1 
-4. ITT 
-4. TII 
-1.T14 
O.HI 
-A. Ill 
•A. 40* 
•1.044 
-10.104 
-4,111 
•4, TII 
•1.12* 
•I.**T 
•I.T4T 
-4. 44* 
-4. BIT 
-I.TM 
•1*114 
-I. til 
•I. 121 
-I.I2I 
•T.29I 
•I.41T 
•1.144 
•4.I1T 

-1.413 
-•.Oil 
-l.lll 
•l.TO* 
l.tll 
-2.111 

-1.144 

9.209 

-0.12T 

•t.iio 

-0.1*2 
I. Ill 



14. 121 
II. TII 
11.411 

14.111 
11.141 
II. TOT 
10. Ill 
II.IIT 
I1.lt* 
IT. 1)1 
11.074 
II. Tl* 
I4.T14 
. 14.2*1 

14.2*1 

40. m 

41. T21 
4I.4IT 
40.211 
41.411 
41.1*7 
42.424 
4J.12C 
41.412 
42. ITl 
4J.127 
4J.T4T 
*2.44t 

42.112 
40.101 
11.174 
41.124 
41.171 
44.171 
41.417 
41.4*2 
41.111 
42.44* 
41.101 
44.1)0 
42.041 
41.011 
41.401 
41.110 
41.11* 
44.144 
41.111 
41. TOT 
42,014 
44.4AT 
44.410 
4I.T02 
44.420 
41.117 
12.114 
10.021 
I2.T2I 
10. til 
12.4)1 
S4.AII 
10.210 
41.411 
11.411 



•12. 1*1 

-u. m 

-II.14J 
•II. 101 

•12.210 
•12.111 - 
•10.21* 
-12.11* 
•12.221 
•21.7*1 
-27.7*4 
-10. TII 
•II.IIT 
•12.111 
•14.2*0 
-2T.il* 
•21.129 
-21.117 
-I*.*" 
•II. Ill 
•IT. ITl 
-27.1*4 
-21.120 
-IT. »11 
-24.101 
•21. TOO 
•II. Oil 
-21.202 
-22.1IT 
-21.121 
•21.011 
•2I.1TI 
•11.411 
•22.410 
-21.117 
•10.209 
•14.121 
•1T.0I2 
-11.171 
•14.4)1 

-11.104 

•1I.0T2 
-10. Ill 
•12.141 
- •II.IOT 
•l.*TI 
-♦.117 
•t.Ttl 
•t.TlT 
•A.III 
•I. It2 
•1.171 
•1.1)0 
•4.101 
-I.IIJ 
-4.TT4 
•1.141 
•0.TA1 
•10. til 
•11.T44 
-12.017 
•11.424 

•11*1)2 



II 
II 
11 
It 
It 
It 
10 
10 
II 
II 
II 
II 
II 
II 
12 
12 
12 
12 
2) 
II 

14 

14 
14 
II 
II 
II 

It 

14 

IA 

14 

IT 

IT 

IT 

IT 

IT 

II 

II 

II 

It 

It 

It 

10 
10 
10 
II 
II 
II 
II 
12 
12 
12 
12 
1) 
II 
I) 

14 
14 
II 
91 
II 
II 

14 
14 



111 0? 
tl* CA 
CL* 0 
SIM CO 

tl* ttl 

Alt * 
•IT C 
TTI II 
TTt C 
TTI CI 
TTI COI 



0 

00 



tti cei 

TTt tl 
T«t « 
TM C 
TMl Ct 
TM CC2 
AIT CA 
61T 0 
111 C* 
111 
1M 

AS* CA 
111 0 
ASK CO 
AIM 002 
Vll C* 
VII 0 
VAL COI 
ITl N 
tti C 
IT) CI 
ITS CO 
IT! Ul 
VAL CA 
VAL 0 
VAL CC1 
ALA II 
ALA C 
ALA Ct 
VAl CA 
VAl 0 
VAL CC1 
HE * 
UI c 
III CI 

tit eci 

Alt * 
A!» C 
A|» Ct 
At* 801 
111 ■ 

in c 
lit ct 

•IT *J 
•LT C 
III % 
III c 
lit CO 
III CC2 
At* « 
Alt C 



11.541 

11.121 
I.II2 
A. 217 
T.1TI 
t.Tlt 
T.IOI 
1.111 
1.202 
4.171 
1.411 
l.Tt* 
l.IOt 
2.001 
1.902 
1.011 
I.))) 
A.4T* 
0.001 
•2.01) 
•0.017 
-2.115 
6.9*1 
•4.119 
-4.2)1 
-4.940 
-4.T4T 
•4.AT4 
•l.tll 
•4.1*0 
•1.910 
•t.lll 
•T.IIO 
•1.121 
•1.404 
•4,49T 
•4.101 
-2.444 

•1.414 

•4.TI0 
•T.1T2 
•1.14A 
-4.111 
-0.9,1* 
•4.11* 
•4.I4A 
-4.497 
•T.2TI 
•4.144 
-1.0T1 
•1.401 
«.t)4 
-t.lll 
•1.112 
-1.421 
•I. IT) 
•l.tll 
•1.941 
1,1*1 
•1.141 
1.141 
1.114 
1.2*1 



11.112 
1*.4I0 
14.1*1 
11.071 
12.102 
12.111 
IT.I21 
11.192 
IT.I01 
11.112 
14.411 
OA. 112 
II. TIT 
14. TII 
19.490 
40.122 
41.7*1 
41. in 
4C.»eo 

42.191 
41.017 
41.101 
41.411 
41.417 
AI.AAI 
44.170 
41.4*1 
41. ATI 
41.411 
19.102 
41.111 
42.172 
41. ttl 
41.102 
4A.2I1 
42.110 
49.111 
42.111 
41.127 
44.110 
44.107 
44.1*2 
44.44* 
41.101 
44.111 
44.1)1 
41.T74 
44.1)1 
4A.191 
4T.0I9 
44.119 
44.192 
40.112 
11.174 
41.122 
01.T41 
11.441 
11.411 
11.919 
•1.414 
11.T41 
14.21) 
10.114 



•11.014 
-10.191 
•12.071 
-14.211 
-12.121 

•11.444 

•12.117 

•11.110 

-51.741 

-28.121 

•19.441 

-11.211 

•12.44* 

•11.047 

•11.211 

-14.24* 

•17.411 

•20.229 

•21.142 

-21.1)0 

-21.111 

-2I.II0 

•19. TII 

-IT. HI 

•21.190 

-29.111 

•29.194 

•24.14) 

•12.499 

-22.141 

-22.I01 

•19.041 

•11.149 

•12.120 

•24.2*4 

-1T.I1T 
-14.817 
•14.119 
-It. Ill 
-11.111 
•14.111 
-11.110 
-II. ITl 
•11.010 
•t.ITT 
•T.I4A 
•0.101 
•T.lll 
•T.22T 
•I. TII 
-1.992 

-a. it* 

-1.114 

-i.tii 

•i.m 

•T.H4 
•1.IIT 
-11.102 
•SI. 141 
-II.IIT 
-11.142 
-It. IT! 
-11.112 
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34 ASP 0 

J* AS P CC 

3* AS* 902 

37 SEE CA 

37 SEI 0 

IT Stl OC 

If Sffl CA 

91 Sfi D 

31 Stl 04 

99 MIS CA 

J* MIS 0 

99 MIS CC 

39 MIS' CD2 

39 MIS Mil 

49 910 CA 

49 MO 0 

49 no CC 

4} AS* ■ 
4S9 QD1 
ASP CI 
ASP C 
LtU M 
LIU C 
LEU Ct 

leu cot 

ITS M 

ITS C 



41 
41 
41 
42 
42 
42 
42 
43 
43 



43 LTS Ct 

43 LTS CD 

43 LTS MX 

44 VAL CA 
44 f At 0 

44 VAL C01 

45 ALA « 
4$ ALA C 
4» ALA CI 
44 CLT CA 
44 CLT 0 
47 CLT CA 
4T 61 T 0 
41 ALA CA 
41 ALA 0 
49 Sfl • 
49 SEA C 
49 SCt CA 
19 MET H 
II MET C 
10 MET CA 
99 MET SO 

91 VAL M 

92 VAL C 

91 9AL Ct 

92 9AL CC2 
•2 M5 CA 
92 »M 0 

92 no ct 

13 Stl M 

93 Sit C 
»» stt Ct 

94 CLU t 
3* CLU C 
9* CLU Ct 
44 cum rn 



9.904 
4*339 
9.449 
1.113 
2.94S 
-1.910 
4.241 
4.943 
9.314 
4.437 
9.T3I 
0*914 
9.749 
9.914 
7-911 
1.132 
19.033 
1.411 
39.325 
9.799 
7.311 
4.115 
2.924 
4.421 
4. SIS 
3.911 
9.437 
2.021 
0.911 
9.337 
-1.407 
•2.423 
•2.724 
•3.494 
•5.141 
•4.931 
-7.912 
-$.934 
•1.014 
-9.911 
-19.255 
-9.944 
-11.149 
-10.947 
-9.092 
-19.135 
-11.443 
-12.912 
-13.441 
-19.427 
-19.430 
-A. 443 
-7.744 
-32.372 
-11.771 
-13.513 
-19.442 
-9.429 
-9*994 
-9.254 
-7.747 
-4.194 

-A . AAA .. 



95.471 

57.999 

57.277 

57.221 

59*303 

99.133 

99.505 

99.251 

99.945 

94.574 

55.071 

94.409 

94.345 

53.910 

54.497 

95.997 

57.495 

54.321 

91*395 

92.239 

52.143 

92.103 

92.907 

92.139 

91.544 

92*135 

52.154 

52*919 

52.942 

91.757 

52.439 

93.994 

52.941 

91*951 

52*507 

59.910 

92.137 

92*904 

92*244 

53.491 

92.979 

93.729 

93.547 

52.994 

94.991 

92.997 

91*952 

50.919 

49.119 

52.749 

94.942 

93.155 

91*915 

59*933 

99*221 

94.193 

94*994 

59*243 

97*797 

97*923 

97.993 

94*599 

4* . *U>* 



-13.179 
-19.104 
-10.213 
-14.312 
-14.151 
-13.979 
-14.417 
-15.215 
-12.234 
-15.291 
-17.419 
-14.454 
-13.319 
-13.109 
-19.131 
-29.571 
-17.992 
-11.415 
-20.429 
-19.224 
-11.131 
-11.551 
-19.374 
-17.901 
-14*511 
-19.944 
-20.911 
-22.149 
-24.339 
-24.419 
-19.745 
•29.434 
-14.592 
-19.971 
-20.0S3 
-21.319 
-29.001 
-14.031 
-14.311 
-14*115 
-11*312 
-9*723 
-9.937 
-4.713 
-7.029 
-3*932 
-3*541 
-4.994 
-7.254 
-3*422 
-1*997 
-2*999 
-2.192 
-9.921 
-9.923 
0*995 
0*299 
-9*324 
2*919 
-1*393 
-3*795 
-2*194 
«.«.%>• 



94 

94 

37 

37 

37 

39 

39 

39 

39 

39 

39 

39 

39 

40 

40 

40 

40 

41 

41 

41 

41 

42 

42 

42 

42 

43 

43 

43 

43 

44 

44 

44 

4* 

45 

45 



ASP CA 
AS* 001 
Sit * 

Itt c 
SEE CA 
Sfl M 
Stt C 
Sit CI 
MIS M 
MIS C 
MIS CI 
HIS M01 
MIS Ctl 
MD M 
990 C 

rto C9 
910 CO 
AS? 002 
AS? CC 
ASP CA 
ASP 0 
LEU CA 
LtU 0 
LEU CC 
LtU C02 
LTS CA 
LTS 0 
LTS CC 
LTS CE 
TIL M 
VAL C 
VAL CO 
VAL CC2 
ALA CA 
ALA 0 



44 CLT M 

44 CLT C 

47 CLT M 

47 CLT C 

49 ALA M 

49 ALA C 

49 ALA C4 

49 Stt CA 

49 Stt 0 

49 SEE OC 
99 MET CA 

50 MET 0 
99 MET CC 
99 MET CE 

51 VAL CA 

51 VAL 0 

91 VAL CC1 

52 920 M 

92 910 C 
92 910 CI 

92 9 10 CO 

93 ill CA 
99 Stl S 

93 Stt OC 

94 CLU CA 
94 CLU 0 
94 CLU CC 
44 CI ii MM 



9.712 
9*795 
1*394 
2*977 
-4.993 
3*143 
9.444 
4.742 
9.454 
4.411 
4.437 
9*795 
9.970 
7.907 
9.154 
9.247 
9*999 
11.149 
19.473 
0.445 
7.394 
4.992 
3.993 
9.192 
9*273 
2*993 
0.594 
0.495 
-9.190 
-9.191 
-2.571 
-1.410 
-0.197 
-4*419 
-4.793 
-3.919 
-4.997 
-1.992 
-9.179 
-9.221 
-9.799 
-11.551 
-9.752 
-11.972 
-9.97T 
-11.952 
-11.997 
-11.912 
-12.901 
-9.941 
-19.237 
-7.992 
-11*421 
-11*499 
-13*499 
-22*244 
-9*999 
-7.479 
-1.254 
-7*294 
-7*933 
-9*249 
• 1.444 



99.729 

97.974 

94.422 

31.99$ 

59.941 

59.414 

54.795 

40.435 

57.399 

94.401 

95.203 

54.354 

53.930 

44.934 

93.210 

97.533 

57.452 

90.399 

91.397 

92.959 

59.947 

92.147 

94.143 

91.343 

49*977 

92.495 

99.920 

92.434 

92.594 

53.935 

52*917 

53*331 

53.194 

51.977 

53.915 

52.354 

52.443 

52.454 

32*797 

92.444 

92.475 

52.190 

53.335 

53.477 

54.255 

51.949 

91.394 

49.443 

94.111 

93.174 

99.437 

13.379 

94.493 

97.123 

99.99« 

93*424 

97*992 

99*224 

94*921 

97*449 

94*243 

94.999 

4t.4*4 



-19.314 
-11.423 
-13.111 
-14.949 
-14.711 
-24.911 
-14.512 
-13.391 
-14.992 
-14.771 
-14.515 
-15.541 
-15.139 
-17.1B7 
-19.317 
-19.141 
-14.774 
-11.441 
-19.211 
-11.944 
-11.977 
-11.444 
-19.499 
-13.944 
-14.359 
-29.721 
-19.120 
-22.910 
-25.240 
-19.490 
-19.731 
-27.313 
-14.553 
-29.910 
-29.793 
-11.741 
-14.539 
-13.793 
-13.572 
-12*339 
-9.941 
-11.417 
-7.452 
-4.401 
-5.450 
-4.974 
-2.575 
-4.399 
-9.903 
-2.947 
-2.412 
'0.43; 
-1.414 
-9.444 . 

0.244 
-0.175 
9.412 
-9.911 
2.127 
-2.421 
-4.379 
-9.927 
-1.44ft 
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34 
35 
35 

55 
ft* 

94 

54 

54 

57 

37 

51 

57 

51 

SI 

51 

51 

51 

55 

55 

55 

55 

55 

40 

40 

40 

40 

41 

41 

41 

41 

42 

42 

42 

42 

45 

43 

43 

44 

44 

44 

44 

44 

45 

45 

44 

44 

44 

4T 

4T 

47 

47 

4? 

41 
41 
41 
45 
45 
74 
74 
71 
71 
71 



O.V 911 

TMO 0 

TMt Ml 
ASM ■ 
ASM 001 
45 M CO 
ASH C 

no n 

rto Co 

MO CA 

rto o 

Ml CA 
ME 0 
PltE CC 
Pnl C02 
ME CE2 
CLB M 
(Lit C 
til CI 
CLM CO 
CLM ME2 
ASP CA 
ASP 0 
ASP CC 
ASP- 002 
ASM M02 
ASM CC 
ASM CA 
ASM 0 
ASM CA 
ASM 0 
ASM CC 
ASM «02 
SCI CA 
SEI 0 
SEI DC 
MIS CA 
MIS 0 
HIS CC 
MIS C02 
MXS ME 2 
CLT CA 
CLT 0 
7NI CA 
TMI 0 
TNI 0C1 
MIS M 
MIS C 

Hi 5 Cm 

MIS B01 
MIS CE1 
7AL ■ 
MAt € 
9AL Ct 
MIL C12 
ALA CA 
ALI 0 
CLT ■ 
CtT C 
Tut a 

T«l C 
T*0 CM 



-3.500 
-9.433 
-9.433 
-9.815 
-7.412 

-5.151 
-6.012 
-4.342 
-7. 134 
-5.475 
-3.555 
-2.7*7 
-0.433 
-3.513 
-5.211 
-4.154 
-2.444 
-0.107 
-1.442 
-1.790 
-2.353 
0.051 
2.127 
2.077 
2.515 
-1.344 
-1.040 
2.557 
2.533 
2.177 
4*351 
2.371 
2.422 
5.109 
3.393 
4.171 
3.954 
3*141 
3.144 
4.054 
3.554 
1.552 
2.230 
4.444 
5.333 
3.437 
3.415 
4.491 
7.304 
0.590 
9.057 
4.092 
5*854 
2.939 
3.319 
3.0)7 
4.021 
5.3*8 
T.44» 
4.420 
4.224 
7.219 



03.777 

31.121 

37.919 

40.310 

51.403 

31.947 

99.494 

37.094 

94.241 

94.433 

34.941 

94.120 

94.577 

37.497 

34.941 

57.430 

37.093 

97.119 

54.403 

91.440 

40.157 

55.40S 

54.792 

95.S50 

52.534 

91.141 

57.747 

57.470 

35.734 

54.042 

52.340 

51.313 

50.103 

50.200 

51.494 

49.790 

50.491 

40.059 

44.974 

44.021 

45.194 

43.320 

41.244 

40.071 

39.117 

40.703 

32.425 

40.443 

44.1*1 

47.071 

44.907 

44.491 

45.749 

44.349 

44.252 

43.705 

44.440 

45.913 

44.702 

43.370 

44.431 

42.304 

42.070 



4.271 
-3.441 
-7.010 
-3.410 
-4.477 
-10.337 
-0.200 
-0.303 
•9.250 
-10.272 
-9.332 
-9.943 
-11.222 
-10.400 
-13.357 
-13.439 
-14.274 
-0.990 
-7.000 
-7.019 
-5.150 
-4.742 

-4.304 
-5.231 
- -4.300 
-7.030 
-2.347 
-2.399 
-2.700 
-0.902 
-1.709 
-1.770 
-9.497 
f .401 
-4.709 
-4.249 
-3.410 
-4.93S 
-7.100 
-3.726 
- -3.135 
-3.340 
-7.030 

-10.134 
-9.934 

-11.441 
-9.404 
-9.274 

-10.143 
-0.044 
-0.274 
-0.299 
-9.731 

-ii.745 

-3.304 
•0.900 
-13.429 
-15.545 
-13.914 
-13.021 
-14.130 
-31.34 3 
-13.191 



93 


vwi 


a 


-0.971 


98.291 


-4.249 


95 


7Mt 


c 


-8.744 


38.139 


-6.779 


93 


TMt 


CO 


-10.904 


39.200 


-9.303 


33 


TMl 


CC2 


-11.432 


59.143 


-4. Oil 


34 


ASM 


«D2 


-4.930 


61.179 


-9.881 


54 


ASM 


CC 


-3.273 


99.923 


-9.333 


54 


ASM 


CA 


-4.742 


90.423 


-0.290 


54 


ASM 


0 


-5.194 


94.044 


-7.470 


37 


P80 


CC 


-7.123 


55.257 


-11.177 


57 


PCO 


CD 


«G.644 


54.179 


-10.235 


57 


PAD 


C 


-4.301 


55.002 


-9.964 


50 


PME 


a 


-3.990 


54.262 


-10*491 


90 


*ME 


C 


-1.712 


57.129 


-10.253 


50 


*ME 


C9 


-2.943 


97.502 


-12.423 


50 


OME 


CD1 


-3.734 


55.700 


-14.059 


50 


*NE 


CE1 


-4.722 


95.255 


-14.920 


90 


*«E 


Ct 


-5.949 


55.939 


-15.931 


59 


CLB 


CA 


-1.172 


97.303 


-7.934 


59 


CLM 


0 


-1.439 


54.003 


-4.113 


59 


CLM 


CC 


-4.942 


59.241 


-4.034 


59 


CLtt 


0E1 


-1.404 


41.288 


-4.036 


40 


ASP 


B 


• •410 


55.895 


-7.211 


40 


ASP 


C 


1.431 


55.247 


-3.090 


40 


ASf 


CO 


1.394 


53.744 


-7.100 


40 


ASP 


DDI 


1.744 


52.337 


-5.190 


41 


ASM 


M 


4.959 


35.265 


-3.930 


41 


ASM 


901 


6.444 


90.546 


-2.073 


41 


asM 


CO 


4.531 


94.401 


-1.704 


41 


ASM 


C 


2.291 


54.632 


-1.940 


42 


ASM 


a 


2.210 


33.434 


-2.440 


42 


ASM 


C 


4.124 


31.093 


-2.479 


42 


ASM 


CO 


1.783 


51.319 


-1.421 


42 


ASM 


DDI 


2.433 


49.077 


-1.343 


43 


SEI 


M 


4.152 


52.104 


-3.741 


43 


5EI 


C 


5.071 


50.254 


-5.209 


43 


5EI 


CO 


4.523 


51.950 


-4.912 


44 


MIS 


« 


4.202 


49.475 


-4.439 


44 


MIS 


C 


3.344 


47.759 


-4.241 


44 


MXS 


CO 


3.104 


47.901 


-3.747 


44 


MXS 


SOI 


2.107 


45.247 


-4.241 


44 


MIS 


CE1 


2.414 


43.944 


-4.054 


45 


CLT 


81 


2.207 


41.420 


-6.387 


45 


CLT 


C 


2.392 


48.434 


-9.937 


44 


TMtt 


M 


3.233 


49.639 


-8.032 


44 


TWA 


C 


5.009, 


49.909 


-10.291 


44 


TMl 


C3 


4.744 


51.511 


-9.447 


44 


TMM 


CC2 


5.534 


52.070 


-10.049 


67 


«IS 


CA 


6.703 


47.341 


-9.450 


47 


MXS 


e 


4.449 


65.638 


-11.150 


#7 


MIS 


CC 


8.593 


66.275 


-0.148 


47 


MIS 


CD2 


9.904 


44.470 


-8.974 


47 


MXS 


MJ«2 


10.470 


45.514 


-8.186 


40 


VAL 


CA . 


4.142 


44.407 


-10.244 


45 


9AL 


0 


4.114 


43.942 


-12.535 


40 


VAL 


CC1 


1*940 


43.240. 


•19.920 


49 


ALA 


M 


1*373 


44.049 


-12.113 


49 


ALA 


c 


4*'l93 


44.390 


-14.411 


49 


ALA 


C9 


2. 332 


47.031 


-13.384 


70 


CLT 


CA 


6.995 


44.905 


-14.470 


70 


CLT 


0 


7*494 


43.194 


•16.119 


71 


TR9 


CA 


7.177 


43.019 


*14.444 


73 


TMM 


9 


4.602 


42.920 


-16.495 


71 


TMM 


BC1 


8.191 


42.992 


•12.390 



0251 44C 



7i 


TKI 


Ct2 


7.774 


44.943 


-13*994 


72 


7 At 


CA 


3*974 


42.491 


-34*494 


72 


f At 


4) 


4*341 


42*314 


-3 9*444 


72 


VAL 


CCI 


1*512 


42*499 


-37*179 


73 


4L4 


m 


4. S3* 


44.417 


-17*519 


73 


4L4 


c 


9.433 


44.333 


-19.335 


73 


4L4 


c» 


3*107 


49*441 


-19*433 


74 


4L4 


CA 


7*474 


47*991 


-19*959 


14 


4L4 


0 


1.954 


44*440 


-21*054 


7$ 


LEU 


■ 


7.450 


41*784 


-21.639 


7$ 


Leu 


c 


9*192 


49*549 


-22.944 


79 


LEU 


CA 


7.948 


90.471 


-22*409 


73 


LEU 


CDS 


4.073 


32*434 


-22*390 


74 


ASK 


II 


9.147 


41.103 


-24*199 


74 


AS« 


001 


10. tso 


45.940 


—27*924 


74 


ASH 


CA 


If .110 


44.451 


-25.994 


74 


ISM 


c 


13.713 


49.944 


-2 5*443 


77 


ASB 




11*104 


49.444 


-25.071 


t * 


ASM 
« # *j 


r 

V 


13.707' 


9 ft • w ft 9 


♦ 9 • 9% 9 


77 


ASM 


CA 


11* 335 


9 ft • W « •> 


— 7 « 1 « 7 


T7 
f 1 


AIM 


001 

M V ft 


ft* . VJ* 


•II 111 
7 ft • J»f> 


—ft 1*9* • 


9* 
• V 


4* ft 


n 


11.1 2 £ 
• * • A ft 9 


4 7 917 
7ft . ft ■ « 


— 7 < 1 IX 

— * 7 • 1 ■*> 


f ■ 


tr t 

*c ■ 


r 

V 


11 n a 

A 9 a • 1 V 


1 7 Tl 9 

> ft » f — ft 


—13*4 34 


f V 


*> * ■» 


^4 


I 1 . 4A 1 


C 1 flit 


_ > r c * 7 
—ft 5*317 


9a 

• 9 


tig 


*j 




11 , III 

>* * > 4 5 


— ZZ«?Z9 


9* 

f 9 






11.11 T 
• 9 • • 1 1. 




— £ 0*2 30 


9 W 


fl c 
» fc c 


C4 


«/ All' 


li. • 91 


_ 9 A a* 7 

— * B .47 f 


77 




CC2 


11. M 7 


41 17D 


— 71 &1 ? 
— * I * VW ft 


• a 

• w 


4L V 


n 


11.411 


41 Tab 


.it * * • 
— * • • 7 • I 


90 


4LV 


c 


14.412 


49*449 


—19.219 


§1 


VAL 


H 


13.S13 


49.744 


"l r *liv 


tl 


VAL 


c 


12*311 


44*919 


•11.217 


tl 


VAL 


CA 


13*001 


44.755 


-14*477 


• 1 


VAL 


CC2 


11*431 


47*241 


-14.231 


42 


Lf U 


CA 


11*312 


45*020 


-2 0*254 


tz 


LEU 


0 


10*838 


43* 354 


-14.400 


42 


LEU 


CC 


11*430 


43*544 


-22*344 


42 


LEU 


coa 


12*339 


42*475 


-23.192 


• 3 


£LV 


CA 


t.133 


43*321 


-19*114 


• 3 


*LV 


0 


4*544 


41*922 


-21*024 


44 


VAL 


CA 


4*973 


39*907 


-19*919 


• 4 


VAL 


0 


4*424 


39.472 


-22.194 


• 4 


VAL 


CCI 


9*410 


37*477 


-2 9*557 


45 


ALA 


m 


9*134 


40.924 


-21*024 


S3 


ALA 


c 


4*713 


42.443 


*>22. in 


f s 


ALA 


CB 


2*t44 


40.443 


•21 .741 


44 


9t0 


CA 


3*413 


4%*43S 


* 9 .*> • # 


•4 


910 


0 


4.291 


44.405 


*>2 


• 4 


no 


CC 


7*130 


43.444 


-24* 544 


47 


SEt 


II 


3.541 


44.474 


-24.749 


• T 


SEt 


c 


1*103 


45*132 


-24.997 


• T 


set 


ct 


2*401 


44.777 


-24*927 


•1 


ALA 


m 


1*917 


44,34* 


-23*742 




ALA 


CA 


-4.213 


44*333 


-29*094 


•4 


4LA 


• 


-4.174 


44.711 


-22.43$ 


44 


Sit 


OS 


•4.144 


47*102 


-24.290 


49 


SEt 


CA 


-3*901 


44.947 


-22.227 


If 


Sit 


0 


-3.793 


44.944 


-20.209 


44 


LEU 


CA 


-2.374 


47*447 


-10.393 


94 


LEU 


0 


-3.942 


49*404 


-19.213 


44 


LtU 


CC 


-4.233 


47.951 


•17*174 


94 


LEU 


CD2 


1.140 


49*524 


-17*047 


tl 


tft 


CA 


-9.2SA 


44*474 


-14*137 



-34- 



72 


VAL 


a 


4*939 


42*997 


.i i * * * 


72 


VAL 


c 


4*312 


43*094 


*1 7 Alt 


72 


VAL 


ct 


2*914 


42*047 


• 1 A ■ ft C 
ft * . f 1 > 


77 


VAL 


CC2 


2*142 


42*327 


*»1 ft 7 5* 


71 


ALA 


CA 


4*947 


49.091 


*>1 4 1 1.T 
* » • ft • f 


73 


ALA 


0 


5*442 


47*184 


•21 21A 
ft* • ft 1 4 


74 


4L4 


II 


4.344 


44.429 


*1 ■ tic 
1 ■ • 4 J > 


74 


ALA 


C 


7*740 


47.449 


• 2 • VI » 


74 


ALA 


ct 


4.453 


47*444 


•1 2.12% 

• * * Tft 3 


75 


LEU 


C4 


7*412 


41*949 


*>2 ).ltl 
ft *• * 7 ft 


75 


LEU 


0 


14.142 


49.754 


•9} lit 
* * . ft > J 


75 


LEU 


cc 


4.123 


59* 913 


.)) 77* 


75 


LEU 


CD2 


3*994 


31*442 


—2 1 ftft« 

ft J. •>! > 


74 


ASM 


a02 


22*315 


44.432 

^ •• * 9 * 


*>7 ft 1 ft M 

ft ■ . J 


74 


ASH 


cc 


11.199 


44^. 7lA 
* •* * i • 




74 


ASH 


CA 


19*359 


* * 9 w 9 9 


* 7 l ftti 
— ft *• VJ4 


74 


ASH 


o 


39*157 


•J » * •> 9 7 


b 9 1 11* 

ft 4*41 f 


77 


ASH 


CA 


12.220 


CI • <7 

7 ■ . 7 > • 


— 21 .481 


71 




n 

V 


I*. 


47*779 


-25.113 


77 


ASK 


CC 


. * • (IV 


4 > fl 77 
» ft » »* f 


_« * i*i 
—21.414 


T7 






1 • « 7*1 


• 7 In 
»* * 741 


-23.925 


71 


* P A 


ft 


• 9 J 9 


92*414 


-24.904 


y* 


* C"* 


n 

V 




s % ■ ** 
91*971 


-2 1.144 


f • 




WW 


«f ■ 7 * 
*>• 111 


53.870 


-2A.494 




Tl 4* 


w** 


1 < 1 IE 


91 * » A a. 

»<• 794 


-21*120 


77 


Tl P 


0 


1 V AA V 


■ 1 in 
9*. 941 


—2 9. 479 


7* 




CC* 
w» • 


Aft* 7*J 7 


2>4* f Ji 


-29. 814 


77 


XLE 


C01 


12* 1 35 


AC t Tl 
97 * i ft) 


»9 *■ III 

—* 8* 155 


40 


6LY 


C4 


34*474 


» » • ▼*>» 


^4 9 A i 9 

— * 7* 7 J 3 


40 


CLT 


o 


15*719 


41 .*>%4 


*>1 • tiftft. 


01 


VAL 


CA 


3 3*411 




_ • . S.1 « 

— ft V . VI 1 


IX 


VAL 


o 


32* 240 


42. Tl* 


»9fl lit 
— ft 9 • 1 1 T 


01 


VAL 


CCI 


14*930 


47.444 


*>1 1 .471 

17*11 J 


42 


LEU 


H 


12*124 


45.445 


ft ▼ . ft ft V 


42 


LCU 


C 


19* 990 


44*928 


*>1 1. 1 1 A 

ft- ■ • 9*9 


42 


LCU 


C9 


12*204 


44*219 


—21.229 


92 


LEU 


CD1 


10*794 


44*457 


-23*223 


93 


CLT 


H 


9.131 


44.190 


—19*914 


43 


CL1 


C . 


9.927 


42.41 1 


—19.825 

ft »• 9 


44 


VAL 


H 


7*272 


41*112 


-19*293 


44 


VAL 


c 


4*144 


48*830 


*»2l lftfl 

ft 1*19 0 


44 


VAL 


C9 


4*254 


98*920 


*>1 A Alt 
1 • » • *1 


44 


VAL 


CC2 


7.190 


38*507 


—IT 7A C 


45 


ALA 


CA 


4*217 


41 .144 


*>27 tct 
— ft- « • ft 9 9 


95 


ALA 


o 


9*240 


41.401 

^ 9m *> W ft 


*>7 7 111 
— ft*. V 99 


G4 


POO 


m 


9*240 


41.1 AA 


— ft J* V7ff 


44 


MO 


c 


4*321 


41 171 
• #• J * ft 


—9 9 li 9 

— ft J* V47 


44 


p to 


(• 


V* *ftft 


it 9*i 


-23*91 3 


94 


490 


CO 


4*977 


42.440 


•21.111 
ft 


47 


SEt 


CA 


2*409 


49*324 


-23.529 


47 


SEt 


e 


9.142 


45.313 


-29.A19 


47 


SEt 


OS 


9*991 


43.143 


*27.543 


09 


ALA 


C9 


-9*143 


43.314 


-21.929 


89 


ALA 


c 


-4.499 


45*717 


-22*490 


99 


SEt 


M 


-2*219 


43*491 


-22*478 


49 


SEt 


C9 


-4*343 


44.94) 


-22*999 


49 


SEt 


C 


-3.134 


44*740 


-29.727 


90 


LEU 


M 


-2*444 


47.454 


-20.037 


90 


LEU 


c 


-3*493 


48.438 


-17.944 


90 


LEU 


C9 


-4.951 


48.273 


-19*474 


99 


LfU 


C01 


•4*924 


44*341 


-17.219 


91 


71« 


Af 


•4*244 


47.944 


-14.938 


91 


wt 


c 


•4*973 


49.790 


-14.485 



0251 446 



-35- 



91 Tit i 

VI 778 CO 

91 Tft CD2 

91 Tfl CC2 

91 tfl ft* 

92 UA CA 

92 ALA 0 

93 »al a 

93 »AL C 
4) VAL CO 
9) VAL CC2 
14 ITS CA 

94 LTS 0 
♦4 ITS CC 

94 LTS CE 
9$ VAL « 

95 VAL C 
9S VAL Ct 
9$ VAL CC2 
9ft LEU CA 
9* LEU 0 
94 LEU CC 

LEU C02 
6LT CA 
CLT 0 

ALA ca 
ALA C 
ASP « 
ASP 001 
ASP CO 
ASP C 
CLT ■ 
CLT C 

sea a 

SEE C 
SEE CO 
*lt a 

CLT C 
CLM B 

CLa C 

CLK CI 
CLa CO 
CLa aE2 

TTt CA 
TTI 0 
TTI CC 

114 TTI C02 
104 TTI CE2 

104 TTS On 
10$ SEI CA 

105 SEt « 

115 SEI DC 
104 TIP CA 
104 TIP 0 
10ft TIP CC 
lift TIP C02 
10ft TIP CE2 
10ft TIP C22 
104 TIP CM 2 
111 HE CA 
10T 2LE 0 
10T SLE Ctl 
10T Uf COl 



94 
91 
97 
91 
91 
99 
99 
99 
99 
160 
100 
101 
101 
101 
102 
102 
101 
101 
103 
101 
101 
104 
104 
104 



-4.49ft 
-7.094 
-T.9T1 

-••us 

-0*112 
-4.949 
-4.723 

-4.701 
-7.957 
-0.195 
-4.371 
.-■•450 
-5.394 
-4.399 
-4.109 
-4.919 
-0.104 
-4.900 
-4.712 
—3.942 
-3.513 
-4.409 
-3.090 
-1.419 
-0.421 
0.101 
-0.504 
-2. T30 
-0.440 
0.14ft 
-0.424 
-1.520 
-2.542 
-4.759 
-1.714 
-5.021 
-0.144 
-9.377 
-10.943 
-11.471 
-12.340 
-13.419 
-12.040 
-12.939 
-11.429 
-10.379 
-9.352 
-0.401 
-14.077 
-14.750 
-15.209 
•12.421 
-12.021 
-U.ft43 
-10.450 
-11.339 
-10.471 
-9.293 
-10.145 
-11.095 
-0.014 
-0.2*3 



47.749 

40.2)7 

49.275 

49.421 

40.752 

00.199 

00.990 

40.993 

40.014 

47.055 

47.370 

50.404 

00.400 

02.320 

04.201 

49.071 

40.499 

47.030 

44.100 

49.103 

01.121 

44.799 

44.002 

52.307 

01.443 

05.470 

03.110 

02.573 

00.902 

01.403 

50.145 

49.113 

47.431 

40.120 

47.094 

47.447 

47.092 

44.534 

47.050 

45.232 

47.307 

49.104 

49.197 

43.124 

43.274 

40.029 

40.959 

40.037 

00.191 

45.144 

45.935 

47.039 

47.391 

44.440 

40.111 

49.012 

00.073 

01.310 

01.291 

44.250 

49.474 

43.704 

42.990 



-14.073 
-17.741 
-10.149 
-19.492 
-21.744 
-12.707 
-12.050 
-11.129 
-0.099 
-10.4)1 
-12.072 
-4.999 
-5.713 
-5.447 
-4.199 
-5.02ft 
-2.541 
-4.319 
-4.132 
-1.404 
-2.334 
-2.072 
-1.045 
0.207 
0.145 
- 1.S10 
1.917 
2.912 
4.093 
5.175 
3.320 
2.140 
2.002 
2.900 
2.532 
4.017 
2.577 
2.520 
2.400 
2.022 
3.274 
4.915 
4.112 
1.501 
-0.007 
2.472 
1.040 
2.171 
3.324 
-0.034 
-2.250 
1.450 
-1.940 
-4.2*5 
-0.204 
0.511 
1.541 
2.500 
2.455 
-3.323 
-5.390 
-1.9?ft 
-0.427 



91 


7TB CO 


—4. 404 


48 .993 


-16.314 


91 


TTH CDl 


• -4.395 


47.41 5 


-10.735 


91 


iti ret 
III 4tl 


—4.993 


47. 172 


-2 8* 898 


91 


TTI LI 


»T Y%4 


4ft 11} 


—20.441 


92 


ALA a 


«A ft**. 
— • . • T * 


49. 958 


—14.104 


92 




—5.823 


50.033 


-1 1 .90 3 


92 


ALA V» 


— J . TTI 


9mm OA 1 


*>t y ift > 
— J A . * • • 


93 


■ At * m. 


_ *. 1 ft 1 


41 ftV A 


— 4 w . J A » 


03 


WAL 0 


_ * « ft 1 
— O. IB* 


4T 44 Y 


— 9 . J f A 


93 


MAI fE1 

VAL L»l 


— V. 2 I 3 


11 Aft ft 


—V. 14) 


*4 


LTS a 


— ft.VOT 


ft ft * * f 

30 .21 T 


— 8 . J 2 7 


94 


LTS C 


—7.331 


49. 90 S 


—5*894 


94 


LTS CO 


—4 .051 


51*974 


-4.81 8 


94 


LTS CD 


—4.041 


53. 70S 


-5. 582 


94 


LTS 82 


-3.733 


35.544 


-4* 387 


93 


VAL CA 


—7.44ft 


48 .457 


—3*920 


05 


VAL 0 


-7.42 5 


48 • 154 


—1.501 


95 


VAL CCl 


—0.841 


44.052 


-5*419 


Oft 


LEU 9 


-3.474 


48.974 


-2.404 


Oft 


LEU C 


—4. 331 


59 .559 


—1. 32 1 


94 


LEU CO 


-3. 509 


48.241 


—1.573 


94 


LEU CDl 


-2.207 


44.184 


—2.143 


97 


CLT a 


-4.324 


39 .975 


-0.084 


97 


CLT C 


-2.343 


52.437 


0. 38 5 


90 


ALA a 


—1.154 


53.448 


0. 750 


91 


ALA CA 


-0.543 


54.048 


0.945 


90 


ALA 0 


1. 393 


52.921 


1*44 3 


99 


ASP D02 


-2*431 


01 .042 


6.151 


99 


ASP CC 


-2.003 


01 .131 


5. 040 


99 


ASP CA 


0. HI 


51.410 


3. 055 


99 


ASP 0 


0.735 


49.313 


A ft * ■ 


100 


CLT CA 


—8. 343 


A • til 

48.521 


« ill 


100 


CLT 0 


-1. &49 


44. 512 


1.479 


19! 


SEI CA 


—3.342 


47 . 501 


3. 9 1 9 


101 


SEE 0 


Ill 


*» « T • 2 


1 . T 9 1 


101 


SEI OC 


—4.411 


41. 434 


K » ft ■ 


102 


CLT CA 


— 7. 07 7 


47 .422 


V ft • A 

1. IT* 


102 


OLT D 


— 1.901 


AC Alt 

45.4 31 


3. «3 v 


103 


fell LI 


. . tic 
—10. 335 


4ft .ITT 


T ft 9 S 
J* Pa O 


103 


CLa 


—1 0.779 


4 5.482 


6.117 


103 


CLa CC 


—11.348 


48.005 


A. ft ft A 


103 


CLa OEl 


. . ||4 

-12.159 


A A 111 

49.814 


ft ft ft 9 

5. T 92 


104 


TTt a 


-11.411 


44.141 


2.451 


104 


TTI C 


-13.031 


43.490 


0*473 


104 


TTI CO 


-12.497 


41.844 


2.143 


104 


TTI CDl 


-21.019 


39.789 


3.377 


104 


TTI " CE 1 


-10.009 


38.985 


3.707 


104 


TTI C2 


-9.544 


39.022 


3.001 


105 


SEI a 


-13.909 


44.572 


0.903 


105 


Sfi C 


-14.172 


45*920 


-1.139 


105 


SEI CO 


-15. 800 


44.121 


0*403 


10ft 


tip a 


-13.079 


44.425 


-0.834 


10ft 


TIP C 


-11.999 


44.434 


-3.012 


10ft 


TIP CO 


-11.321 


40.254 


-1.335 


10ft 


TIP COl 


-12.042 


49.524 


9.244 


104 


TIP BE 1 


-12.491 


00.350 


1.340 


104 


TIP CEO 


-9.275 


49.952 


0.374 


10ft 


TIP C23 


-4.440 


00.541 


1.525 


107 


ILE a 


-11.339 


43.330 


-2.401 


107 


1LE C 


-11.333 


43.394 


-4.190 


107 


SLE CO 


-9.944 


43.19/3 


-2.523 


107 


2LE CC2 


-9.432 


42.930 


-3.301 


109 


IL! a 


-12.994 


43.292 


-3.377 



-36- 



0251 44 



in 
100 

10* 
188 
119 
109 
109 
1*9 
110 
110 

til 
111 
111 
111 

112 

112 

112 

112 

113 

113 

113 

113 

113 

113 

113 

114 

114 

114 

115 

115 

115 

115 

11* 

11* 

117 

117 

117 

117 

lift 

lift 

11* 

lift 

119 

119 

119 

119 

120 

120 

120 

120 

121 

121 

121 

121 

122 

122 

122 

122 

223 

12) 

123 

121 

124 



IH CA 
lit 0 
ILE CC1 
lit C01 

asm c* 

ASM a 

ASM CC 

ASM B02 

tit C* 

(iv e 

ILE C* 

ILC • 

ILC CC1 
III C01 
tLU C* 
tLU 0 
tLU CC 
till DEI 

tip m 

tip c 

TIP Cl 
TIP C01 
tIP ME1 
Tl> CE3 
TIP C23 
ALA H 
ALA C 
ALA CO 
ILC CA 

ilc o 

ILE Ctl 
2LE C01 
ALA CA 
ALA 0 
ASM m 
ASM C 
ASM CO 
ASM 0D1 
ASM M 
ASM C 
ASM CO 
ASM 001 
MIT M 
MIT C 
MET CO 
MET SD 
ASP M 
ASP C 
ASP CO 
ASP 001 
VAL M 
VAL C 
f AL CO 
9 At CfcZ 
ILC CA 

Hi e 

ILE CC1 
ILC C01 
ASM CA 
ASM 0 
ASM CC 
ASM M02 
MET CA 



-14.11* 
-14.194 

-15.432 
-15.204 
-14.440 
-1*.S20 
-14.433 
-11.952 
-11.929 
-12.403 
-13.921 
-11.421 
-11.510 

-u.ni 

-10.4*7 
-17.047 
-19.041 
-25.094 
-14.07* 
-13.012 
-14.14* 
-13.597 
-11.451 
-10.410 
-13.019 
-13.199 
-11.299 
-15.070 
-1*.077 
-15.210 
-14.004 
-17.390 
-17.323 
-13.423 
-13.427 
-13.415 
-14.5*5 
-14.223 
-12.240 
-14.247 
-14.510 
-11.414 
-10.02S 
-9.010 
-0.700 
-0.904 
-7.022 
-7.355 
-7.001 
-7.021 
-0.29* 
-4.755 
-4.707 
-ft.245 
-4.029 
-0.*** 
-9.97* 
-J. 145 
-3.T00 
-0.492 
-0.34* 
-3**50 



42.722 
43.320 
41.077 
40.0*5 
44.010 
4A.2T2 
47.404 
40.447 
45.917 
44.929 
42.334 
42.304 
40.501 
39.70* 
43.37* 
44.130 
42.917 
40.0** 
45.403 
45.6*3 
47.553 
49.73* 
50.443 
47.445 
47.199 
44.101 
-43.179 
43.192 
41.440 
42.225 
39.03* 
39.411 
44.440 
45.255 
45.390 
44.97* 
44.950 
49.012 
43.725 

42.44* 

42.0*3 

42.321 

42.500 

40.734 

42.4*1 

44.943 

40.437 

3d. 390 

39.154 

40.70* 

39.117 

39.534 

30.507 

37.914 

39.799 

30.012 

40.392 

39.700 

09.054 

41.431 

40.040 

40.747 

39.973 



-4.321 
-4.552 
. -2.402 
-1.131 
-5.91* 
-0.235 
-♦.333 
-3.442 
-7.0*5 
-10.934 
-9.099 
-11.140 
-7.455 
-*.33* 
-19.04* 
-12.24* 
-0.135 
-0.01* 
-10.971 
-13.140 
-11.434 
-12.491 
-13.723 
-13.809 
-14.179 
-12.132 
-14.752 
-13.1*0 
-14.097 
-17.070 
-13.043 
-11.743 
-14.050 
-10.343 
-17.122 
-19.034 
-17.424 
-17.773 
-10.9*7 
-19.143 
-23.279 
-20.759 
-10.475 
-10.921 
-17.055 
-17.52* 
-19.504 
-20.05* 
•21.23* 
-23.004 
-20.115 
-15.70* 
-17.49* 
-10.04* 
-13.397 
-12.4*9 
-13*0*3 
-12.393 
-11.232 
-9.033 
-20.777 
-9.720 
-7.430 



too 

100 
109 
109 
109 
209 
109 
130 
110 
111 
121 
111 
111 
112 
112 
112 
112 
112 
113 
113 
113 
113 
113 
113 
113 
114 
114 
115 
135 
115 
115 
11* 
114 
11* 
117 
117 
117 
117 
110 
110 
110 
119 
119 
119 
129 
119 
120. 
220 
120 
120 
121 
121 
121 
222 
222 
122 
122 
123 
223 
223 
123 
12* 
124 



ILE C 
2LE CO 
3Lf CC2 
ASM M 
ASM C 
ASM CO 
ASM 031 
6LT a 
6LT C 
1L! a 
ILI C 
ILE CO 
ILE CC2 
ClU M 
SLO C 
tLU CO 
tLU CO 
tLU Of 2 
70P CA 
IIP 0 
TIP CC 
TIP C02 
TIP CE2 
TIP CZ2 
TIP CM2 
ALA CA 
ALA 0 
ILE M 
ILE C 
ILE CO 
ILE CC2 
ALA N 
ALA C 
ALA CO 
ASM CA 
ASM 0 
ASM C6 
ASM M02 
ASM CA 
ASM 0 
ASM Ct 
ASM M02 
MET CA 
MET 0 
MET Ct 
MET Cf 
ASP CA 
ASP 0 
ASP Ct 
*S» 002 
9AL CA 
9AL 0 
VAL CC1 
ILE 5 
ILE C 
ILE CO 
UE Ct2 



ASM 
ASM 



ASM CO 
ASM 051 
MET M 
M5T C 



-14.A39 
-15.24* 
-14.5*0 
-14.7*1 
-14.232 
-13.280 
-17.455 
-12.951 
-12.100 
-12.379 
-13.059 
-12.714 
-13.122 
-14.893 
-15.072 
-17.229 
-18.724 
-19.123 
-14.75* 
-14.319 
-13.40* 
.-12.441 
-12.545 
-11.494 
-10.752 
-12.333 
-12.9*3 
-14.174 
-13.921 
-14.000 
-17.151 
-16.534 
-14.744 
-10.011 
-14.553 
-12.997 
-14.400 
-14.931 
-13.7*0 
-11.417 
-13.737 
-24.13* 
•20.232 
-10.010 
-9.080 
-9.902 
-0.400 
-0.030 
* -0.237 
-9.327 
-4.22* 
-4.204 
-3.730 
-4.310 
-5.020 
-7.474 
-7.221 
-4.2*3 
-3.502 
-1.020 
-0.0*3 
-3.430 
-2.423 



43.494 

42.2*3 

42.024 

44.958 

44.047 

47.359 

44.495 

45.900 

44.712 

43.539 

42.3*0 

40.940 

39.791 

43.075 

44.347 

43.099 

41.024 

41.921 

44.400 

45.932 

40.554 

40.552 

49.7*1 

90.045 

49.074 

44.0*5 

43.074 

42.340 

42.405 

40.040 

40.148 

43.527 

45.049 

45*510 

45.947 

45.43* 

40.177 

40.249 

42.4*2 

42.309 

43.040 

44.094 

42.222 

39.038 

43.083 

44.0*1 

39.110 

37.289 

39.730 

39.133 

39.491 

40.700 

30.17* 

30*970 

39.2*2 

39.A04 

39.083 

40.222 

48.404 

40.470 

30.990 

39.404 

39.409 



-3.384 
-3.328 
-4.093 
-4.981 
-7.88* 
-3.207 
-4.44* 
-*.774 
-8.812 
-8.244 
-9.942 
-9.3*4 
-9.347 
-9.280 
-11.171 
-9.141 
•0.415 
-9.04* 
-12.900 
-14.332 
•12.481 
-13.463 
-14.215 
-15.274 
-IS. 403 
-13.874 
-15.971 
-14.119 
-15.854 
-13.922 
-14.735 
-15.247 
-17.271 
-15.151 
-18.139 
-19.820 
-14.939 
-15.73* 
-19.832 
-20.932 
-21.395 
-22.133 
-11.479 
-18.759 
-14.392 
-18.243 
-20.330 
-10.490 
-22.434 
-22.739 
-14.974 
-15.909 
-14.42? 
-14.590 
-12.427 
-12.44* 
-10.954 
-12.110 
-9.8*1 
-11.497 
-11.019 
-0.832 
-4.414 
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ii* 
it* 
it* 
in 
in 
in 
it* 
u« 
in 
it* 
in 
in 

iti 

in 

in 

in 

in 

in 

no 

no 

in 

in 

in 

nt 

nt 

in 

tn 

in 

11« 

II* 

111 

IIS 

111 

111 

II* 

II* 

11* 
II* 
lift 
1M 
111 
111 
111 
111 
II* 

119 
111 
1*0 
1*0 
1*0 
1*4 
1*1 
1*1 
1*1 
1*1 
1*1 
1*1 
1*1 
Iftl 
1*1 
1*1 
1*1 
Iftft 



•if e 
■it tc 

• |T CI 

si* u 
in o 
in •* 

LtU u 
LIU 0 

liu tt 

LIU C01 
OH C* 

on e 

«H C* 
tLf 0 

rtc c» 
no o 

MO Ct 

III « 

on c 

ill Cft 
OH ft 



Si' c 

IM ft 
in c 

III Cft 

ALft II 

ALft C 
ALft ct 
ALA Cft 
*k* 
LfU 
LIU 



0 

c 



Liu Cft 
LtU C01 
Ltl ft 
Lll C 
HI Cft 
Ltl CO 
Ltl Nt 
ftLl C* 
*Ll 0 
ftLl ft 
ftLl C 
ALI Cft 
VftL C* 
fftL 0 
ffftL C61 
ftl» • 
Aft* C 
AH Cft 
A|» 001 
Ltl • 
Ltl C 
itl Cft 
HI CO 
Ltl lit 
AH Cft 
ftlft 0 
VAL ft 
VAl C 
VftL Cft 
fftL CO! 
Al* CA 



•2.10A 
-ft. Ill 
•f.1%0 

in 

0. tn 

1. *** 

•i.«*t 
•j.i** 
-o.ofti 
•ft.if* 

-1*011 

-J.A4ft 
-4.*11 
-**9ftl 
-4.111 
-ft. 1)1 

•••419 
-1.011 

in 

-9 .0*9 
•10.011 

-lt.tes 

•11.0*0 

•n. no 

.14.110 

-u.i*i 
•n.ftie 
-ii.iftft 

•It.fttl 
-14.111 
-2!.*7» 
-1**111 
•11.011 
•ll.**0 
-1*.909 
•ift.Oft* 
-14.001 
•11.011 
•11.101 
-17.711 
-It.fOl 

-ift.021 

-14.101 
•lft.lt! 
-11.1** 
•11.101 
•10. lit 
-14. in 
-14.011 
•lft.1** 
•l*.lt» 
-Ift.ftlft 
-14.171 
•1ft. 010 
•I*. ftlft 
•11.111 

•i4.ni 
•n. no 
•ii.sii 
•i*.i*ft 

-It. on 

-11.101 

•ift.n* 



11.101 
40.012 
1ft. Oil 
ftft.tlt 
ftl.ftlt 
*0.**ft 

40.1*1 

Sft.llft 

41.4*1 

At.tftO 
Sf.fttl 
00.010 
01**9ft 
Oft. lift 
14.121 

it. in 
o*.n* 

n. en 

J4.ll* 

ii.in 
n.iftt 

04.111 

11.011 

04.101 

Oft. on 

04. Iftft 
04.141 
Ol.ltft 

17. tit" 
17.115 

it. nt 

Oft. 001 
17.121 
11.411 
04.121 
01. tit 
02.1*1 
21.112 
tft.411 

04.414 

11.0*9 
14.101 
14.414 

01. nt 

oi. m 

14.070 ' 
11.114 
01.11* 
01.111 
01.1*9 
10.401 
04.1*1 
ll.*tl 

ift.m 

Oft.Ut 
40.017 

Oft. tot 

Oft.lftt 
01. in 
01.191 
91**79 
02.191 
01*09* 



-ft. 019 
•t.*7l 
•t.O*l 

•o.io 

-1.O01 
•1.01! 
•1.11* 
-2. lit 

•1.111 
•ft. 019 
ft. HI 
1.220 
9.4*2 
1.27* 
l.ttl 
A.10I 
t.ltl 
ft. tit 
4.1tft 
1.11* 
4.1*9 
1,1*2 
2. It* 
1.914 
0.1*1 
1.19* 
0.091 
1.99* 
•O.Tftt 
•1.0*9 
-1.0*4 
-2.7CI 
-0.111 
-2.292 
•2*119 

•4.110 

•2.11* 
•0.10* 
•4.1*0 
-4. Ml 
•9.201 
•1.121 
•1.117 
•1.914 
•1.117 
- •1.H7 
-7.144 
•9. lit 
•10.01* 
-I. Ill 
•9.201 
-1.120 
-tl*9*» 

-10. in 

•10.194 
-10.171 
•ll.ftl* 
•14.111 

-11.112 
•14.494 

*12*11* 
•11.01* 
•1*.**1 



It* 
It* 
Itl 
111 
111 
lift 
tt* 
It* 
It* 
ltl 
121 
111 
III 
111 
129 
129 
119 
110 
110 
110 

111 
111 
lit 
lit 
lit 

119 
111 
11* 
11* 
11* 
111 
111 
111 
111 
114 
111 
. 11* 
11* 
111 
197 
ill 
111 
111 
119 
119 
119 
199 
140 
1*0 
l*t 
1*0 
1*1 
1*1 
1*1 
1*1 
1*2 
1*2 
1*1 
1*9 
1*1 
1*9 
l«* 
I** 



■ 17 CI 
ait 1? 
Alt « 
II* C 

on c« 

LfU ft 
LfU C 
LtU CI 
LIU C01 
Olt * 

Ott C 
ftlt ft 
• Lt C 
010 ft 
MO C 
MO CI 
Old CO 
11* CI 



9 

oc 

Cft 

0 



in 
on 

Olf 

on 

in ci 
on o 
on so 

AH CA 
ALI 0 
At* M 
ALft C 
Akft CI 
LtU Cft 
LIU 0 
LtU CC 
LtU COt 
Ltl CA 
111 C 
Ltl CC 
LY1 CI 
AH M 
ALI C 
ALft CO 
All Cft 

ftift e 

VAl ft 
VAL C 
VAl CO 
VAL C*l 
ll» CA 
01* 0 
Alt CO 
Al' Otl 
ltl Cft 
Ltl 0 

wit cc 

HI CI 

ftlft ft 

ALA C 
ALA Cft 
VAL CA 
VAL 0 

VAL CO! 
Alft M 
ALA C 



-4.1*1 
-1.III 
-l.*l* 
•ft.*tt 

1.121 

-1.411 

-2**ll 
•2.191 
•1.21ft 
•2.121 
-0.11* 
-4.121 

-4. ft** 

-4.91* 

-ft.llft 

•4.040 

-4.219 

-ft. *70 

-••9*1 

-ft. 721 
-10.02* 
•12.*1I 
-1*.*01 
-14.199 
-14.411 
-17.107 
-11.141 

-ii. ni 

-14.411 
-11.2*1 
-14.191 
•11.194 
-11.009 
-10.112 
•14.1*1 
-11.119 

-14.1*1 
-11.741 
-1ft. 7** 
•17.111 
•11.01* 
•1ft. 001 

•i*.m 

•11.910 
-11. All 

-n.ftio 

•11.070 

-11.11* 

-14.010 
-19.111 
-lft.199 
•11.919 
-14.700 
-11.11* 
•10.17} 
-11.147 
-11.111 
•11*170 



•11*1*0 

•1*.1*0 
•10.1*0 
•11.111 

•1ft. 920 



09.111 
09.*7| 
41.41* 

40.111 
41.027 
40.071 
19.194 
41.0ft! 
41.111 
19.112 
OI.UO 
01.**1 

o*. on 

19.117 
0*.IOft 
14.41* 
00.170 

o*.*n 

01.111 
14.424 
14*121 
04. lit 
01**!! 
94.1U 
01.199 
14.017 
9*. 417 
94.211 
07.0*9 
01**00 
91.1** 
14.120 
01.110 
14.007 
11.997 
11.411 
12.041 
lft.10! 
04.240 
Oft* 101 

|4.141 

07.111 
94.1*1 
01.019 
04. ltl 
14.471 
01*700 
01*491 
11.171 
10.040 
00. lit 
11.004 
01.241 
07.014 
09.011 
01*041 
01.010 
04.417 

ot*toi 

11.014 
01*970 
It. HI 
It. HI 



-4.110 
-ft.410 
-4.102 
-4. HI 
•4.121 
-1.111 
•1.101 
•1.410 
-1.17ft 
•ft. 411 
1.412 
2.222 
4.114 
9.402 

A. 012 
1.11* 
0.411 
4.02) 
4.929 

B. 4D9 
0.914 
4.111 

0.011 
ft. 12* 

1.171 
1.12* 
-1.014 
0.294 
-1.174 
-0.117 
-1.10* 
•1.190 
-1.191 
-1.119 
-1.011 
-1.101 

-1.14ft 

-2.71ft 
-1.1*1 

•ft.»4| 

-4.2ft! 
•ft. 411 
•0.142 
-1.021 
-0.120 
-ft. 1*1 
•4. til 

•i. in 

-11.190 
•1.11* 
-4.121 
-lft. Iftft 
•11.111 
-11.10ft 
•11.110 
-11.1ft* 
-11.121 
•11.941 
•ll.ftIO 
•ll.ftlt 
-11.4ft! 
•10.011 
•ll.lftl 
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144 
I4| 
141 
14$ 
14* 
144 
14T 
14T 
MT 
I4T 
14| 
I4f 
S4f 
14* 
14* 
I4t 
14* 
110 
S00 
IIP 
111 
III 
SIS 

III 

SIS 

111 
111 
III 

114 
114 

III 
III 
III 
III 
lift 
lift 
lift 

Hi 
117 
III 
111 
III 
lift 
III 
IM 
19* 
lit 
Iftl 
SftO 
Iftl 
1*1 
US 

Iftl 
Iftl 
Iftl 
Iftl 

141 
Iftl 
IM 
1ft* 
166 
Iftl 
Iftl 



All C 
lit ft 
lit c 
III CI 
»LT ft 
•IT C 
Vftl 
V41 



Vftl CI 
»il Ctl 
Vftl CI 
fftl 0 
tftl Ctl 



fftl Cft 
fAW cti 

m ci 

Vftl 0 
Vftl CCS 
Alft tt 
AH C 
All CI 
AH Cft 
lift 0 
HI ft 
All C 
All Cft 
«11 Cft 

•it o 

ASK Cft 
A|K 0 
AI« Cft 
ASK KD2 
•lU Cft 
•lU 0 
ALU CC 
ClU Oil 
•if M 

•l» c 
tm ■ 

TM 001 
Tm Cft 

TM 0 

sit ee 
in Cft 

III 6 
•if Cft 
Sit o 

lift Cft 
lit 0 
Alt DC 
lit Ci 
lit o 

lit ec 

lift Cft 
lift 0 

lit St 

TM Cft 
TM 0 
T*i fiftl 
Vftl II 
fftl c 



•it. lie ii.ioi 

-14.107 21.9*9 

-IS. 409 14. TT] 

-IT. 010 H.STft 

•14.977 mm 

•11.179 14.4*1 

•u.uo is. ifti 

-'••ID I4.|»4 

•nasi ift.tTT 

-11.1*0 IT. lift 

-T.40I I4.no 

-4.S4C Ift.Hl 

•I.DTt 11.4ft! 

-»•*•• 14.)$! 

•9.700 14.11ft 

•ft. 114 I4.M0 

-f.414 II. lift 

•••111 14.9IT 

-••MI 1ft. TT» 

•••VTJ 14.4)1 

•1.1*1 14,94* 

•1.000 11.014 

•••SIT 11.190 

••TI4 ll.4Sft 

•I.TIft l4.4ftft 

• •HI 11.102 

•■••I 11. Til 

I-TIO 11.011 

1.041 14.111 

A.IM 11-147 

•••44 14. TIT 

*.J0X 14.IM 

••••0 14. T0J 

*•*»* 17.111 

*♦•» 11.117 

1.174 10.4)7 

1*401 11.442 

1.T44 14.122 

••••f 11. 017 

A.S01 11.412 

I.HT 1T.TI1 

A. TOT 21.417 

••111 24.417 

••*T* IT.lll 

1*141 11.004 

♦.III 11.110 

••SIT 11.211 

••••4 11,104 

4.»0I 11.92ft 

••••4 10.T77 

A.AIft I0.S4T 

• •••4 11.021 

••1*7 12.721 

••111 12.140 

••••4 11. Oil 

•••All 14.TS0 

-1.071 |«.9M 

•1*102 11.711 

•••00 10.140 

1.401 10,192 

1*M4 lft.202 

•••III 11.742 

••••1ft 1ft. 149 



•14.011 
-11.70! 
•IT. 02* 
-1ft. 41* 
•IT. fftl 
••••••I 

-17.11* 
•1ft. Ill 
•11.119 
•Ift.210 
•10. 001 
•14.710 

•u.ioi 
-ii. in 
•n.ftii 
•n.iii 

•11.00ft 
•10.001 
•0.490 
•11*441 
•••III 
•ft. 417 
•ft.lOT 
•1.112 
•J. 447 
•1.012 
•S.lll 
-1.191 

• •US 
•••IIS 

1.037 
4.291 
0.100 

• •SSI 
4.OT0 

• •112 
A.Sftl 
1.112 
4.22T 
4.111 
1.112 
ft. 217 

• •TO! 
T.07I 

10. lit 

• ••II 
1.010 
•••OS 

ft. Ill 

7.01* 

• ••ft! 
ft. SIS 
T.U1 
1.104 
0.41C 
1.990 

1.104 

1.111 
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The above structural studies together with the kinetic 
data presented herein and elsewhere (Philipp, M. , et 
al. (1983) Biochem. 51, 5-32; Svendsen, 

1 B (1976) ^-rlsbera Res. Coiratu 41, 237-291; 

Markland, S.F.'ld; Stauffe, D.C., et al. (1965) is. 
mm. chem. 241, 5333-5338) indicate that the subsites 
in the binding" cleft of subtilisin are capable of 
interacting with substrate amino acid residues from 
P-4 to P-2'. 

The most extensively studied of the above residues are 
Glyl66, Glyl69 and Alal52. These amino acids were 
identified as residues within the S-l subsite. As 
seen in Fig. 3, which is a stereoview of the S-l 
subsite, Glyl66 and Glyl69 occupy positions at the 
bottom of the S-l subsite, whereas Alal52 occupies a 
position near the top of S-l, close to the catalytic 
Ser221. 

All 19 amino acid substitutions of Glyl66 and Glyl69 
have been made. As will be indicated in the examples 
which follow, the preferred replacement amino acids 
for Glyl66 and/or Glyl69 will depend on the specific 
amino acid occupying the P-l position of a given 
substrate . 

The only substitutions of Alal52 presently made and 
analyzed comprise the replacement of Alal52 with Gly 
and ser. The results of these substitutions on P-l 
specificity will be presented in the examples. 

in addition to those residues specifically associated 
with specificity for the P-l substrate amino acid, 
Tyrl04 has been identified as being involved with P-4 
specificity. substitutions at Phel89 and Tyr217, 



35 
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however, are expected to respectively effect P-2' 
p-1* specificity. 



and 



The catalytic activity of subtilisin has also been 
modified by ain 9l e amino acid substitutions at Asni, . 
5 The catalytic triad of subtilisin is shovm in Fig. 4. 
M can be seen, f«a», Kis64 and Asp 3 2 are positioned 
W facilitate nueleophilic attach by the serine 
hydoxylate on the carbonyl of the scissile peptide 
bLd crystallograrhic studies of subtilisin 

10 Xrtus *t *1 <»»> H- 

fctthews, et »A. l^SisL^M, 252. 7U0-71W, 

"euros. a«« aa, 

show that two hydrogen bonds are formed «J> -ne 
oxyanion of the- substrate transition state. one. 
15 hydrogen bo,d donor is fx. the catalytic ■«»»>» 
main-chain amide while the other is fro. one of .he 
1IE2 protons the asparagine-155 side chain. See 

Fig. 4. 

20 Asnl55 was .instituted ' with Ala, Asp, His, Clu and 

de to investigate the 
Thr. These substitutions were ma«« 

L stnbiUzation of the charged 
intermediate of the ■ transition state comple, = by J* 
potentiai hydrogen bond between the side chain of 
25 Lm» and the oxyanion of the intermediate. These 
particular substitutions caused large decreases in 
substrate turnover, ,cat (2 00 to 4.000 fold, margin* 
decreases in substrate binding Km (up to 7 fold, , and 
a ioss in transition state stabilization 
30 to 4.7 Xcal/mol. The retention of Km and the drop in 
Xcat will ma*, these mutant enzymes useful a. binding 
proteins for specific peptide seances the nature o 
which will be determined by the specificity of the 
precursor protease. 

35 
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Various other amino acid residues have been identified 
which affect alkaline stability. In SOJne cases 
mutants having altered alkaline stability also have 
altered thermal stability. 

5 In B amyloliquefaniPns subtilisin residues Asp36 
11.107, Lysl70, Ser204 and Lys213 have been identified 
as residues which upon substitution with a different 
ammo acid alter the alkaline stability of the mutated 

="> C ° mPared ^ PreCU " or e ^y-e. The 

substitution of As P 36 with Ala and the substitution of 
Lysl70 with Glu each resulted in a mutant enzyme 
having a lower alkaline stability as compared to the 
wild type subtilisin. When Ilel07 was substituted 
with vi # Ser204. substituted with Cys, Arg or Leu or 
Lys213 substituted with Arg, the mutant subtilisin had 
a greater alkaline stability as compared to the wild 
type subtilisin. However, the mutant Ser204P 



demonstrated a decrease in alkaline stability. 



20 



In addition, other residues, identified as being 
associated with the modification of other properties 
of subtilisin, also affect alkaline stability. These 
residues include Ser24, MetSO, Glul56, Glyise, Glyi69 

25 !!LT!| 7 ' SPe<=ifiCally tte followin * Particular 
substitutions result in an increased alkaline 

stability: Ser24C, MetSOF, Glyl56Q or S, Glyl66A, H 
K, » or Q, Glyl69S or A, and Tyr217F, K, R or L. The 
mutant MetSOV, on the other hand, results in a 

30 de ": e "r, ^ alkaline Btab ^ity of the mutant 

subtilisin as compared to wild type subtilisin. 

Other residues involved in alkaline stability based on 
the alkaline stability screen include Aspi97 and 
Met222. Particular mutants include Aspl97(R or A) and 
Met 222 (all other amino acids). 
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Various other residues have been identified as being 
involved in thermal stability as determined by the 
thermal stability screen herein. These residues 
include the above identified residues which effect 
alkaline stability and Metl99 and Tyr21. These latter 
two residues are also believed to be important for 
alkaline stability. Mutants at these residues include 
1199 and F21. 

The aiiino acid sequence of B. amvloliouef aciens 
substiliein has also been modified by substituting two 
or more amino acids of the wild-type sequence. six 
categories of multiply substituted mutant subtil is in 
have bean identified. The first two categories 
comprise thermally and oxidatively stable mutants. 
The next three other c&tegories comprise mutants which 
combine the useful properties of any of several single 
mutations of B. amvloliguef aciens subtilisin. The 
last category comprises mutants which have modified 
alkaline and/or thermal stability. 

The first category comprises double mutants in which 
two cysteine residues have been substituted at various 
amino acid residue positions within the subtilisin 
molecule. Formation of disulfide bridges between the 
two substituted cysteine residues results in mutant 
subtilir.ins with altered thermal stability and 
catalytic activity. These mutants include A21/C22/C87 
and C24/C87 which will be described in more detail in 
Example 11. 

The second categosry of multiple 6ubtilisin mutants 
comprise*; mutants which are stable in the presence of 
various oxidizing agents such as hydrogen peroxide or 
peracidc Examples 1 and 2 describe these mutants 
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which include F50/I124/Q222 , F50/I124 , F50/Q22 2 , 
F50/L124/Q222, I124/Q222 and L124/Q222. 

The third category of multiple subtilisin mutants 
comprises mutants with substitutions at position 222 
combined with various substitutions at positions 166 
or 169. These mutants, for example, combine the 
property of oxidative stability of the A222 mutation 
with the altered substrate specificity of the various 
166 or 169 . substitutions. Such multiple mutants 
include A166/A222, A166/C222, F166/C222, K166/A222, 
K166/C222, V166/A222 and V166/C222. The K166/A222 
mutant subtilisin,. for example, has a kcat/Xm ratio 
which is approximately two times greater than that of 
the single A222 mutant subtilisin when compared using 
a substrate with phenylalanine as the P-l amino acid. 
This category of multiple mutant is described in more 
detail in Example 12. 

The fourth category of multiple mutants combines 
substitutions at position 156 (Glu to Q or S) with the 
substitution of L^s at position 166. Either of these 
single mutations improve enzyme performance upon 
substrates with glutamate as the P-l amino acid. When 
these single mutations are combined, the resulting 
multiple enzyme mutants perform better than either 
precursor. See Example 9. 

The fifth category of multiple mutants contain the 
substitution of up to four amino acids of the B. 
amvloliouefaciens subtilisin sequence. These mutants 
have specific properties which are virtually identicle 
to the properties of the subtilisin from fi. 
licheniformis . The subtilisin from 1. licheniformis 
differs from B. amvlol iauef aciens . subtilisin at 87 out 
of 275 amino acids. The multiple mutant 
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F50/S156/A169/L217 vas found to hove similar substrate 
specificity and kinetics to the lichenif ormis enzyme. 
(See Example 13.) However, this is probably due to 
only three of the mutations (S156, A169 end L217) 
which are present in the substrate binding region of 
the enzyme. It is quite surprising that, by making 
only three changes out of the 87 different amino acids 
between the sequence of the two enzymes, the B. 
aroyloliguifaciens enzyme was converted into an enzyme 
with properties similar to £. lichenif ormis enzyme. 
Other enzymes in this series include 
F50/Q156/N166/L217 and F50/S156/L217 . 

The sixth category of multiple mutants includes the 
combination of substitutions at position 107 (lie to 
V) with the substitution of Lys at . position 213 with 
Arg, and the combination of substitutions of position 
204 (preferably Ser to C or L but also to all other 
amino acids) with the substituion of Lys at position 
213 with R. Other multiple mutants which have altered 
alkaline stability include Q156/K166, Q156/N166, 
S156/K1S6, S156/N166 (previously identified as- having 
altered substrate specificity) , and F50/S156/A169/L217 
(previously identified as a mutant of. B. 
amvloliguifaciens subtilisin having properties similar 
to subtilisin from B, licheniformis l . The mutant 
F50/V107/R213 was constructed based on the observed 
increase in alkaline stability for the single mutants 
F50, V107 and R213. It was determined that the 
V107/R213 mutant, had an increased alkaline stability 
as compared to the wild type subtilisin. In this 
particular mutant, the increased clkaline stability 
was the result of the cumulative stability of each of 
the individual mutations. Similarly, the mutant 
F50/V107/R213 had an even greater alkaline stability 
as compared to the V107/R213 mutant indicating that 
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the increase in the alkaline stability due to the F50 
mutation was also cumulative. 

Table IV summarizes the multiple mutants which have 
been made including those not mentioned above. 

5 * 

In addition, based in part on the above results, 
substitution at the following residues in subtilisin 
is expected to produce a multiple mutant having 
increased thermal and alkaline stability: Ser24, 

10 Met50, Ilel07, G1U156; Glyl66, Glyl69, Ser204, Lys213, 
Gly215, and Tyr217. 
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TABLE IV 



10 



15 



20 



25 



30 



Double Mutants 

C22/C87 

C24/C87 

V45/V48 

C49/C94 

C49/C95 

C50/C95 

C50/C110 

F50/I124 

F50/Q222 

I124/Q222 

Q156/D166 

Q156/K166 

Q156/N166 

S156/D166 

S156/K166 

S156/N166 

S156/A169 

A166/A222 

A166/C222 

F166/A222 

F166/C222 

K166/A222 

X166/C222 

V166/A222 

V166/C222 

A169/A222 

A169/A222 

A169/C222 

A21/C22 



Triple, Quadruple 
or Other Multiple 



F50/I124/Q222 

F50/L124/Q222 

F50/L124/A222 

A21/C22/C87 

F50/S156/N166/L217 

F50/Q156/N166/L217 

F50/S156/A169/L217 

F50/S156/L217 

F50/Q156/K166/L217 

F50/S156/K166/L217 

F50/Q156/K166/K217 

F5 0/S 15 6/X16 6/K2 17 

F50/V107/R213 

IS153/S156/A158/G159/S160/A161- 
164/I165/S166/A169/R170] 
L204/R213 

R213/204A, E, Q, D, N, G, K, 
V, R, T, P, I, M, F, Y, W 
or H 

V107/R213 



In addition to the above identified amino acid 
35 residues, other amino acid residues of subtilisin are 
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also considered to be important vith regard to 
substrate specificity. Mutation of each of these 
residues is expected to produce changes in the 
substrate specificity of subtilisin. ' Moreover, 
multiple nutations among these residues and among the 
previously identified residues are also expected to 
produce subtilisin mutants having novel substrate 
specificity. 

Particularly important residues are His67, Ilel07, 
Leul26 and Leul35. Mutation of His67 should alter the 
S-l' subsite, thereby altering the specificity of the 
mutant for the P-l' substrate residue'. Changes at 
this position could also affect the pH' activity 
profile of the mutant. This, residue was identified 
based on the inventor's substrate ; modeling, from 
product inhibitor complexes. 

Ilel07 is involved in P-4 binding.. Mutation at this 
position thus should alter specificity for the P-4 
substrate residue in addition to the observed effect 
on alkaline stability. Ilel07 was also identified by 
molecular modeling from product inhibitor complexes. 

The S-2 binding site includes the Leul26 residue. 
Modification at this position should therefore affect 
P-2 specificity. Moreover, this residue, is believed 
to be important to convert subtilisin to an amino 
peptidase. The pH activity profile should also be 
modified by appropriate substitution. These residues' 
were identified from inspection of the refined model, 
the three dimensional structure from modeling studies. 
A longer side chain is expected to preclude binding of 
any side chain at the S-2 subsite. Therefore,, binding 
would be restricted to subsites S-l, S-l? , ; S-2' ; ,. S-3' 
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and cleavage would be forced to occur after the amino 
terminal peptide. 

Leul35 is in the S-4 subsite and if mutated should 
alter substrate specificity for P-4 if mutated . This 
5 residue was identified by inspection of the 
three-dimensional structure and modeling based on the 
product inhibitor complex of F222. 

In addition to these sites, specific amino acid 
2o residues within the segments 97-103, 12 6-129 and 
213-215 are also believed to be important to substrate 
•binding. 



15 



Segments 97-103 and 126-129 form an antiparaliei beta 
sheet with the main chain of substrate residues P-4 
through P-2. Mutating residues in those regions 
should affect the substrate orientation through main 
chain (enzyme) - main chain (substrate) interactions, 
since the main chain of these substrate residues do 
2 0 not interact with these particular residues within the 
S-4 through S-2 subsites. 

Within the segment 97-103. Gly97 and Asp99 may be 
mutated to alter the position of residues 101-103 
25 within the segment. Changes at these sites must be 
compatible, however. . In B. amylolicruifaciens 
subtil isin Asp99 stabilizes a turn in the main chain 
tertiary folding that affects the direction of 
residues 101-103. B. licheniformis subtilisin Asp97, 
functions in an analogous manner. 
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In addition to Gly97 and Asp99, SerlOl interacts with 
Asp99 in g. amylicruefaciens subtilisin to stabilize 
the same main chain turn. Alterations at this residue 
should alter the 101-103 main chain direction. 
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Mutations at Glul03 are also expected to affect the 
101-103 main chain direction. 

The side chain of Glyl02 interacts with the substrate 
P-3 amino acid. Side chains of substituted amino 
5 acids thus are expected to significantly affect 
specificity for the P-3 substrate amino acids. 

All the amino acids vithin the 127-129 segment are 
considered important to substrate specificity. Gly 
10 127 is positioned such that its side chain interacts 
with the S-l and S-3 subsites. Altering this residue 
thus should alter the specificity for P-l and P-3 
residues of the substrate. 

The side chain of Glyl29 comprises a part of both the 
S-2 and S-4 subsites. Altered specificity for P-2 and 
p_4 therefore would be expected upon mutation. 
Moreover, such mutation may convert subtilisin into an 
amino peptidase for the same reasons substitutions of 
Leul26 would be expected to produce that result. 

The Prol29 residue is likely to restrict the 
conformational freedom of the sequence 126-133, 
residues which may play a major role in determining 
P-l specificity. Replacing Pro may introduce more 
flexibility thereby broadening the range of binding 
capabilities of such mutants. 

The side chain of Lys213 is located within the S-3 
30 subsite. All of the amino acids within the 213-215 
segment are also considered to be important to 
substrate specificity. Accordingly, altered P-3 
substrate specificity is expected upon mutation of 
this residue. 
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The Tyr214 residue does not interact with substrate 
but is positioned such that it could affect the 
conformation of the hair pin loop 204-217. 

Finally, mutation of the Gly215 residue should affect 
the S-3 f subsite, and thereby alter P-3 1 specif icity. 

In addition to the above substitutions of amino acids, 

the insertion or deletion of one or snore amino acids 

within the external loop comprising residues 152-172 

aay also affect specificity. This is because these 

residues may play a role in the "secondary contact 

region" described in the model of streptomyces 

subtilisin inhibitor complexed with subtilisin. 

Hirono, et al. (1984) J. Mol. Biol. 178 , 389-413. 
15 « 

Thermitase K has a deletion in this region, which 
eliminates several of these "secondary contact" 
residues. In particular, deletion of residues 161 
through 164 is expected to produce a mutant subtilisin 
having modified substrate specificity. In addition, a 
rearrangement in this area induced by the deletion 
should alter the position of many residues involved in 
substrate binding, predominantly at p-1. This, in 
turn, should affect overall activity against 
proteinaceous substrates. 
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The effect of deletion of residues 161 through 164 has 
been shown by comparing the activity of the wild type 
(WT) enzyme with a mutant enzyme containing this 
deletion as well as multiple substitutions (i.e., 
S153/S156/A158/G159/S160/A161-164/I165/S166/A169/ 
£170) . This produced the following results: 
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kcat Km kcat/Km 

WT 50 1.4xl0" 4 3.6x10 5 

Deletion mutant 8 5.0xl0 -6 1.6xl0 6 



The WT has a kcat 6 times greater than the deletion 
mutant but substrate binding is 28 fold tighter by the 
' deletion mutant. The overall efficiency of the 
10 deletion mutant is thus 4.4 times higher than the WT 
enzyme. 

All of these above identified residues which have yet 
to be substituted, deleted or inserted into are 
15 presented in Table VI. 
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TABLE VI 
Substitution/Insertion/Deletion 
Residues 



His67 

Leul26 

Leul35 

Gly97 

Asp 9 9 

SerlOl 

Glyl02 

G1U103 

Leul26 

Glyl27 

Glyl28 

Prol29 

Tyr214 

Gly215 

Glyl66 

Tyrl67 

Prol68 



Alal52 

Alal53 

Glyl54 

Asnl55 

Glyl56 

Glyl57 

Glyl60 

Thrl58 

Serl59 

Serl61 

Serl62 

Serl63 

Thrl64 

Vall65 

Glyl69 

Lysl70 

Tyrl71 

Prol72 
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The following disclosure is intended to serve as a 
representation of embodiments herein, and should not 
be construed as limiting the scope of this 
application. These specific examples disclose the 
construction of certain of the above identified 
mutants. The construction of the other mutants, 
however, is apparent from the disclosure herein and 
that presented in EPO Publication No. 0130756. 

All literature citations are expressly incorporated by 
reference. 

EXAMPLE 1 



Identification of Peracid Oxidizable 
15 Residues of Subtilisin Q222 and L222 

As shown in Figures 6A and 6B, organic peracid 
oxidants inactivate the mutant subtilisins Met222L and 
Met222Q (L222 and Q222). This example describes the 
identification of .peracid oxidizable sites in these 
20 mutant subtilisins. 

First, the type of amino acid involved in peracid 
oxidation was determined. Except under drastic 
conditions (Means, G.E., et al. (1971) Chemical 

25 Modifications of Proteins , Holden-Day, S.F., CA, 
pp. 160-162) , organic peracids modify only methionine 
and tryptophan in subtilisin. Difference spectra of 
the enzyme over the 250nm to 350nm range were 
determined during an inactivation titration employing 

30 the reagent, diperdodecanoic acid (DPDA) as oxidant. 
Despite quantitative inactivation of the enzyme, no 
change in absorbance over this wavelength range was 
noted as shown in Figures 7A and 7B indicating that 
tryptophan was not oxidized. Fontana, A., et al. 

35 (198-0) Methods in Peptide and Protein Sequence 
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Analysis (C. Birr ed.) Elsevier. New York, p- 309. 
The absence of tryptophan modification implied 
oxidation of one or more of the remaining methionines 
of B. amvloliquefaciens subtilisin. See Figure 1. 

To confirm this result the recombinant subtilisin 
Met222F was cleaved with cyanogen bromide (CNBr) both 
before and after oxidation by DPDA. The peptides 
produced by CNBr cleavage were analyzed on high 
resolution SDS-pyridine peptide gels (SPG) . 
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Subtilisin Met222F (F222) was oxidized in the 
following manner. Purified F222 was resuspended in 
0.1 M sodium borate pH 9.5 at 10 mg/ml and was added 
to a final concentration of 26 diperdodecanoic acid 

15 (DPDA) at 26 mg/ml was added to produce an effective 
active oxygen concentration of 30 ppm. The sample was 
incubated for at least 30 minutes at room temperature 
and then quenched with 0.1 volume of 1 M Tris pH 8.6 
buffer to produce a final concentration of 0.1 M Tris 

20 pH 8.6). 3mM phenylmethylsulfonyl fluoride (PMSF) was 
added and 2.5 ml of the sample was applied to a 
Pharmacia PD10 column equilibrated in 10 mM sodium 
phosphate pH 6.2, 1 mM PMSF. 3.5 ml of 10 mM sodium 
phosphate pH6.2, lmM PMSF was applied and the eluant 

25 collected. 

F222 and DPDA oxidized F222 were precipitated with 9 
volumes of acetone at -20 e C. The samples were 
resuspended at 10 mg/ml in 8M urea in 88% formic acid 

30 and allowed to sit for 5 minutes. An equal volume of 
200 mg/ml CNBr i« 88% formic acid was added (5 mg/ml 
protein) and the samples incubated for 2 hours at room 
temperature in the dark. Prior to gel 
electrophoresis, the samples were lyophilized and 

35 resuspended at 2-5 mg/ml in sample buffer (1% 
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pyridine, 5% NaDodS0 4 , 5% glycerol and bromophenol 
blue) and disassociated at 95°C for 3 minutes. 

The samples were electrophoresed on discontinuous 
polyacrylamide gels (Kyte, J., et al. (1983) 
5 Anal. Bioch. 133 , 515-522) . The gels were stained 
using the Pharmacia silver staining technique 
(Sammons, D.T\ , et al. (1981) Electrophoresis 2 
135-141) . 

10 The results of this experiment are shown in Figure 8. 
As can be seen, F222 treated with CNBr only gives nine 
resolved bands on SPG. However, when F222 is also 
treated with DPDA prior to cleavage, bands X, 7 and 9 
disappear whereas- bands 5 and 6 are greatly increased 

25 in intensity. 

In order to determine which of the methionines were 
effected, each of the CNBr peptides was isolated by 

2o reversed phase HPLC and further characterized. The 
buffer system in both Solvent A (aqueous) and Solvent 
B (organic) for- all HPLC separations was 0.05% 
triethylamime/trifloroacetic acid (TEA-TFA) . In all 
cases unless noted, solvent A consisted of 0.05% 

25 TEA-TFA in H 2 0, solvent B was 0.05% TEA-TFA in 
1-propanol, and the flow rate was 0.5 ml/minute. 

. For HPLC analysis, two injections of 1 mg enzyme 
digest were used. Three samples were acetone 

30 precipitated, washed and dried. The dried 1 rog 
samples were resuspended at 10 mg/ml in 8M urea, 88% 
formic acid; an equal volume of 200 mg/ml CNBr in 8 8% 
formic acid was added (5 mg/ml protein) . After 
incubation for 2 hours in the dark at room 

3 _ temperature, the samples were desalted on a 0 . 8 cm X 7 
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cm column of Tris Acryl GF05 coarse resin (IBF, Paris, 
France) equilibrated with 40% solvent B, 60% solvent 
A. 200 ul samples were applied at a flow rate of 1 ml 
a minute and 1.0-1.2 ml collected by monitoring the 
absorbance at 280nm. Prior to injection on the HPLC, 

5 each desalted sample was diluted with 3 volumes of 
solvent A. The samples were injected at 1.0 ml/min (2 
minutes) and the flow then adjusted to 0.5 ml/min 
(100% A). After 2 minutes, a linear gradient to 60% B 
at 1.0% B/roin was initiated. From each 1 mg run, the 

10 pooled peaks were sampled (50ul) and analyzed by gel 
electrophoresis as described above. 

Each polypeptide isolated by reversed phase HPLC was 
further analyzed for homogeneity by SPG. The position 
15 of each peptide on the known gene sequence (Wells, 
J. A., et al. (1983) Nucleic Acidfe Res. 11 7911-7924) 
was obtained through a combination of amino acid 
compositional analysis and, where needed, amino 
terminal sequencing. 

20 

Prior to such analysis the following peptides were to 
rechromatographed. 

1. CNBr peptides from F222 not treated with DPDA: 

25 

Peptide 5 was subjected to two additional reversed 
phase separations. The 10 cm C4 column was 
equilibrated to 80%A/ 20%B and the pooled sample 
applied and washed for 2 minutes. Next an 0.5% ml 
30 B/rain gradient was initiated. Fractions from this 
separation were again rerun, this time on the 25 cm C4 
column, and employing 0.05% TEA-TFA in 
acetonitrile/l-propanol (1:1) for solvent B. The 
gradient was identical to the one just described. 

35 
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Peptide "X" was subjected to one additional separation 
after the initial chromatography. The sample was 
applied and washed for 2 minutes at 0.5ml/min (100%A) , 
and a 0.5t ml B/min gradient was initiated. 



_ Peptides 7 and 9 were rechromatographed in a similar 
5 

manner to the first rerun of peptide 5. 

Peptide 8 was purified to homogeneity after the 
initial separation . 

10 

2. CKBr Peptides from DPDA Oxidized F222: 



Peptides 5 and 6 from a CNBr digest of the oxidized 
F222 were purified in the same manner as peptide 5 
from the untreated enzyme. 

Amino acid compositional analysis was obtained as 
follows. Samples (-InK each amino acid) were dried, 
hydrolyzed in vacuo with 100 ul 6N HC1 at 106°C for 24 
hours and then dried in a Speed Vac. The samples were 
analyzed on a Beckmann 6300 AA analyzer employing 
ninhydrin detection. 

Amino terminal sequence data was obtained as 
previously described (Rodriguez, H. , et al. (1984) 
Anal. Biochem. 134 , 538-547) . 

The results are shown in Table VII and Figure 9. 
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TABLE VII 

Amino and COOH terminii of CNBr fragments 
Terminus Method 
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Fragment 

X 
9 
7 
8 

Sox 
6 ox 



amino, method 

1, sequence 
51, sequence 
125, sequence 

200, sequence 

1, sequence 

120, composition 



COOH. method 

50, composition 

119, composition 

199, composition 

275, composition 

119, composition 

199, composition 



Peptides 5ox and 6ox refer to peptides 5 and 6 
isolated from CNBr digests of the oxidized protein 
where their respective levels are enhanced. 

From the data in Table VII and the comparison of SPG 
tracks for the oxidized and native protein digests in 
Figure 8, it is apparent that (1) MetSO is oxidized 
leading to the loss of peptides X and 9 and the 
appearance of' 5; and (2) Metl24 is also oxidized 
leading to the loss of peptide 7 and the accumulation 
of peptide 6. Thus oxidation of g. amyloliguif aciens 
subtilisin with the peracid, diperdocecanoic acid 
leads to the specific oxidation of methionine at 
residues 50 and 124. 



EXAMPLE 2 

30 

Substitution at MetSO and Met 124 
?n Subtilisin Met222Q 

The choice of amino acid for substitution at Met50 was 
based on the available sequence data for subtilisins 

35 
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from B. licheniformJg (Smith, E.C., et al. (1968) 
J, Biol. Chem. 243, 2184-2191), B.DY (Nedkov, P., et 
al. (1983) Hoppe Savler's. Z- Ph ysiol, chem. 364 
1537-1540), B. amvlosaccharitigus (Markland, F.S., et 
al. (1967) J. Biol. Chem. 242 5198-5211) and B. 
subtilis (Stahl, M.L., et al. (1984) J. Bacterid . 
158, 411-418). in all cases, position 50 is a 
phenylalanine. See Figure 5. Therefore, Phe50 was 
chosen for construction. 

At position 124, all known subtilisins possess a 
methionine. See Figure 5. Molecular modelling of the 
x-ray derived protein structure was therefore reguired 
to determine the most probable candidates for 
substitution. From all 19 candidates, isoleucine and 
leucine were chosen as the best residues to employ. 
In order to test whether or not modification at one 
site but not both was sufficient to increase oxidative 
stability, all possible combinations were built on the 
Q222 backbone (F50/Q222, I124/Q222, F50/I124/Q222) . 



A. Construction of Mutations 
Between Codons 4 5 and 50 

All manipulations for cassette mutagenesis were 
carried out on pS4.5 using methods disclosed in EPO 
Publication No. 0130756 and Wells, J. A., et al, (1985) 
Gene 34 , 315-323. The p*50 in Fig. 10, line 4, 
mutations was produced using the mutagenesis primer 
shown in Fig. 10, line 6* and employed an approach 
designated as restriction-purification which is 
described below. Briefly, a M13 template containing 
the subtilisin gene, M13mpll-SUBT was used for 
heteroduplex synthesis (Adelman, et al (1983), DNA 2, 
183-193). Following transfection of JM101 (ATCC 
33876), the 1.5 kb £coPI-BamHI fragment containing the 



-66- 



0251 446 



subtilisin gene vas Bubcloned from M13mpll SUBT rf 
into a recipient vector fragment of pBS42 the 
construction of which is described in EPO Publication 
Ho. 0130756. To enrich for the mutant sequence (pA50, 
line 4), the resulting plasmid pool was digested with 

5 Kpn l, and linear molecules were purified by 
polyacrylamide gel electrophoresis. Linear molecules 
were ligated back to a circular form, and transformed 
into £. coli MM294 cells (ATCC 31446). Isolated 
plasmids were screened by restriction analysis for the 

10 KEBl site. Kpn l* plasmids were sequenced and 
confirmed the p^50 sequence. Asterisks in Figure 11 
indicate the bases that are mutated from the wid type 
sequence (line 4). pa50 (line 4) was cut with StuI 
and pcoRI and the 0.5 Kb fragment containing the 5 1 

15 half of the subtilisin gene was purified (fragment 1). 
pA50 (line 4) was digested with Kpn l and EcoR I and the 
4.0 Kb fragment containing the 3 1 half of the 
subtilisin gene and vector sequences was purified 
(fragment 2). Fragments 1 and 2 (line 5), and duplex 

20 DNA cassettes coding for mutations desired (shaded 
sequence, line 6) were mixed in a molar ratio of 
1:1:10, respectively. For the particular construction 
of this example the DNA cassette contained the triplet 
TTT for codon 50 which encodes Phe. This plasmid was 

25 designated pF50. The mutant subtilisin was designated 
F50. 



B. Construction of Mutation 
30 Between Codons 122 and 127 

The procedure of Example 2A was followed in 
substantial detail except that the mutagenesis primer 
of Figure 11, line 7 was used and restriction- 
purification for the EcoRV site in pal24 was used. In 
35 addition, the DNA cassette (shaded sequence, Figure 
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11, line 6) contained the triplet ATT for codon 124 
which encodes He and CTT for Leu. Those plasmids 
which contained the substitution of He for Metl24vere 
designeated pll24. The mutant subtilisin was 
designated 1124. 



C. Construction of Various 
F50/I124/O222 Multiple Mutants 

The triple mutant, F50/I124/Q222, was constructed from 
10 a three-way ligation in which each fragment contained 
one of the three mutations. The single mutant Q222 
(PQ222) was prepared by cassette mutagenesis as 
described in EPO Publication No. 0130756. The F50 
mutation was contained on a 2.2kb Ava il to PyuII 
15 fragment from pF50; the 1124 mutation was contained on 
a 260 bp PvuII to Ava il fragment from pI124; and the 
Q222 mutation was contained on 2.7 Jcb Ava il to Ava il 
fragment from pQ222. The three fragments were ligated 
together and transformed into E. coli MM294 cells. 
20 Restriction analysis of plasmids from isolated 
transformants confirmed the construction. To analyze 
the final construction it was convenient that the 
Avail site at position 798 in the wild-type subtilisin 
gene was eliminated by the 1124 construction. 

25 

The F50/Q222 and I124/Q222 mutants were constructed in 
a similar manner except that the appropriate fragment 
from pS4.5 was used for the final construction* 

30 

D. Oxidative Stability of Q222 Mutants 

The above mutants were analyzed for stability to 
per acid oxidation. As shown in Fig. 12, upon 
incubation with diperdodecanoic acid (protein 2mg/mL, 
35 oxidant 75ppm[0]), both the I124/Q222 and the 
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F50/I124/Q222 are completely stable whereas the 
F50/Q222 and the Q222 are inactivated. This indicates 
that conversion of Metl24 to 1124 in subtilisin Q222 
is sufficient to confer resistance to organic peracid 
oxidants - 

5 

EXAMPLE 3 

Subtilisin Mutants Having Altered 
10 Substrate Specif icity-Hydrophobic 
Substitutions at Residues 166 

Subtilisin contains an extended binding cleft which is 
hydrophobic in character. A conserved glycine at 
residue 166 was replaced with twelve non-ionic amino 
15 acids which can project their side-chains into the S-l 
subsite. These mutants were constructed to determine 
the effect of changes in size and hydrophobicity on 
the binding of various substrates. 

20 

A. Kinetics for Hydrolysis of Substrates 
Having Altered P-l Amino Acids by 
Subtilisin from B. Amyloliquefaciens 

Wild-type subtilisin was purified from ]3. subtilis 
> 5 culture supematants expressing the B. amylolique - 
faciens subtilisin gene (Wells, J. A., et al. (1983) 
Nucleic Acids Res. 11 . 7911-7925) as previously 
described {Estell, D.A. , et al. (1985) J. Biol . Chem. 
260 , 6518-6521). Details of the synthesis of 
0 tetrapeptide substrates having the form 
succinyl-L-AlaL-AlaL-ProL-[X]-p-nitroanilide (where X 
is the PI amino acid) are described by DelMar, E.G., 
et al . (1979) *inal. Biochem. 9,9, 316-320. Kinetic 
parameters, Km(M) and Jccatfs* 1 ) were measured using a 
5 modified progress curve analysis (Estell, D.A. f et al. 
(1985) J. Biol. Chem. 260 , 6518-6521). Briefly, plots 
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of rate versus product concentration were fit to the 
differential form of the rate equation using a 
non-linear regression algorithm. Errors in kcat and 
Km for all values reported are less than five percent. 
The various substrates in Table VIII are ranged in 
order of decreasing hydrophobicity . Nozaki, Y. 
(1971), J ♦ Biol. Chem. 246 , 2211-2217; Tanford C. 
(1978) Science 200, 1012). 



10 



TABLE VIII 



15 



20 



25 



30 



PI substrate 
Amino Acid 


kcat(S _1 ) 


l/Km(M _1 ) 


kcat/Km 
(s- 1 M-l) 


Phe 


50 


7,100 


360,000 


Tyr 


28 


40,000 


1,100,000 


Leu 


24 


3,100 


75,000 


Met 


13 


9,400 


120,000 


His 


7.9 


1,600 


13,000 


Ala 


1.9 


5,500 


11,000 


Gly 


0.003 


8,300 


21 


Gin 


3.2 


2,200 


7,100 


Ser 


2.8 


1,500 


4,200 


Glu 


0.54 


32 


16 



The ratio of kcat/Km (also referred to as catalytic 
efficienty) is the apparent second order rate constant 
for the conversion of free enzyme plus substrate (E+S) 
to enzyme plus products (E+P) (Jencks, W.P., Catalysis 
in Chemistry and Enzymology (McGraw-Hill, 1969) pp. 
321-436; Fersht, A., Enzyme Structure and Mechanism 
(Freeman, San Francisco, 1977) pp. 226-287) . The log 
(kcat/Km) is proportional to transition state binding 



35 
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energy, AG^. A plot of the log kcat/Km versus the 
hydrophobicity of the PI side-chain (Figure 14) shows 
a strong correlation (r =0.98), with the exception of 
the glycine substrate which shows evidence for 
non-productive binding. These data show that relative 
differences between transition-state binding energies 
can be accounted for by differences in P-l side-chain 
hydrophobicity. When the transition-state binding 
energies are calculated for these substrates and 
plotted versus their respective side-chain 
hydrophobicities, the line slope is 1.2 (not shown). 
A slope greater than unity, as is also the case for 
chymotrypsin (Fersht, A., Enzyme Structure and 
Mechanism (Freeman, San Francisco, 1977) pp. 226-287; 
Harper, J.W., et al. (1984) Biochemistry , 23, 
2995-3002) , suggests that the PI binding cleft is more 
hydrophobic than ethanol or dioxane solvents that were 
used to empirically determine the hydrophobicity of 
amino acids (Nozaki, Y., et al. J. Biol. Chem. (1971) 
246 . 2211-2217; Tanford, C. (1978) Science 200 # 1012). 

For amide hydrolysis by subtilisin, Xcat can be 
interpreted as the- acylation rate constant and Km as 
the dissociation constant, for the Michaelis complex 
(E*S), Ks. Gutfreund, H., et al (1956) Biochem. J. 63, 
656. The fact that the log kcat, as well as log 1/Km, 
correlates with substrate hydrophobicity is consistent 
with proposals (Robertus, J.D., et al. (1972) 
Biochemistry 11, 2439-2449? Robertus, J.D., et al. 
(1972) Biochemistry H, 4293-4303) that during the 
acylation step the P-l side-chain moves deeper into 
the hydrophobic cleft as the substrate advances from 
the Michaelis complex (E-S) to the tetrahedral 
transition-state complex (E-S**). However, these data 
can also be interpreted as the hydrophobicity of the 
PI side-chain effecting the orientation, and thus the 
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Busceptibility of the scissile peptide bond to 
nucleophilic attack by the hydroxyl group of the 
catalytic Ser221. 

The dependence of Jccat/Km on P-l side chain 
hydrophobicity suggested that the - Jccat/Km for 
hydrophobic substrates may be increased by increasing 
the hydrophobicity of the S-l binding subsite. To 
test this hypothesis, hydrophobic amino acid 
substitutions of Glyl66 were produced. 

Since hydrophobicity of aliphatic side-chains is 
directly proportional to side-chain surface area 
(Rose, G.D., et al. (1985) Science 229 , 834-838; 
Reynolds, J. A.,' et al. (1974) Proc. Natl. Acad. Sci. 
2SA 71, 2825-2927), increasing the hydrophobicity in 
the S-l subsite may also sterically hinder binding of 
larger substrates. Because of difficulties in 
predicting the relative importance of these two 
opposing effects, we elected to generate twelve 
non-charged mutations at position 166 to determine the 
resulting specificities against non-charged substrates 
of varied size and hydrophobicity. 



B. Cassette Mutagenesis of 
the PI Binding Cleft 

The preparation of mutant subtilisims containing the 
substitution of the hydrophobic amino acids Ala, Val 
and Phe into residue 166 has been described in EPO 
Publication No. 0130756. The same method was used to 
produce the remaining hydrophobic mutants at residue 
166. in applying this method, two unique and silent 
restriction sites were introduced in the subtilisin 
genes to closely flank the target codon 166. As can 
be seen in Figure 13, the wild type sequence (line 1) 
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was altered by site-directed mutagenesis in M13 using 
the indicated 37mer mutagenesis primer, to introduce a 
13 bp delection (dashedline) and unique SacI and JCmal 
sites (underlined sequences) that closely flank codon 
166. The subtilisin gene fragment was subcloned back 

5 into the E. coli - B. subtil is shuttle plasmid, pBS42, 
giving the plasmid pAl66 (Figure 13, line 2). pAl66 
was cut open with Sac I and Xmal, and gapped linear 
molecules were purified (Figure 13, line 3). Pools of 
synthetic oligonucleotides containing the mutation of 

10 interest were annealed to give duplex DNA cassettes 
that were ligated into gapped pAl66 (underlined and 
overlined sequences in Figure 13, line 4). This 
construction restored the coding sequence except over 
position 166(NNN; line 4). Mutant sequences were 
* 15 confirmed by dideoxy sequencing. Asterisks denote 
sequence changes from the wild type sequence. 
Plasmids containing each mutant B. amylolicru efaciens 
subtilisin gene were expressed at roughly equivalent 
levels in a protease deficient strain of B. subtil is, 

20 BG2036 as previously described. EPO Publication No. 
0130756; Yang, M. , et al. (1984) J. Bacteriol. 160, 
15-21; Estell, D.A.\ et al (1985) J. Biol. Chem. 260, 
6518-6521. 



25 

C. Narrowing Substrate Specificity 
by Steric Hindrance 

. To probe the change in substrate specificity caused by 
steric alterations in the S-l subsite, position 166 

30 mutants were kinetically analyzed versus PI substrates 
of increasing size (i.e., Ala, Met, Phe and Tyr) . 
Ratios of kcat/Km are presented in log form in 
Figure 15 to allow direct comparisons of transition- 
state binding energies between various enzyme- 

35 substrate pairs. 
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According to transition state theory, the free enery 
difference between the free enzyme plus substrate 
(E + S) and the transition state complex (E*S^) can be 
calculated from equation (1) , 

(1) a gJ « -rt In kcat/Km + RT In kT/h 

in which kcat is the turnover number, Km is the 
Michaelis constant, R is the gas constant, T is the 
temperature, k is Boltzmann's constant, and h is 
Planck's constant. Specificity differences are 
ezpressed quantitatively as differences between 
transition state binding energies (i.e., aaG^) , and 
can be calculated from equation (2) . 

(2) AA gJ = -RT In (kcat/Km) A / (kcat/Km) B 

A and B represent either two different substrates 
assayed againt the same enzyme, or two mutant enzymes 
assayed against the same substrate. 

As can be seen from Figure 15A, as the size of the 
side-chain at position 166 increases the substrate 
preference 6hifts from large to small P-l side-chains. 
Enlarging the side-chain at position 166 causes 
kcat/Km to decrease in proportion to the size of the 
P-l substrate side-chain (e.g., from Glyl66 
(wild-type) through W166, the kcat/Km for the Tyr 
substrate is decreased most followed in order by the 
Phe, Met and Ala P-l substrates) . 

Specific steric changes in the position 166 
side-chain, such as he presence of a 0-hydroxyl group, 
0- or 7-aliphatic branching, cause large decreases in 
kcat/Km for larger PI substrates. Introducing a 
0 -hydroxy 1 group in going from A166 (Figure 15A) to 
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S166 (Figure 15B) , causes an 8 fold and 4 fold 
reduction in kcat/Km for Phe and Tyr substrates, 
respectively, while the values for Ala and Met 
substrates are unchanged. Producing a ^-branched 
structure, in going from S166 to T166, results in a 

5 drop of 14 and 4 fold in kcat/Km for Phe and Tyr, 
respectively. These differences are slightly 
magnified for V166 which is slightly larger and 
isosteric with T166. Enlarging the ^-branched 
substituents from V166 to 1166 causes a lowering of 

10 kcat/Km between two and six fold toward Met, Phe and 
Tyr substrates. Inserting a 7-branched structure, by 
replacing M166 (Figure ISA) with L166 (Figure 15B) , 
produces a 5 fold and 18 fold decrease in kcat/Km for 
Phe and Tyr substrates, respectively. Aliphatic 

15 7-branched appears to induce less steric hindrance 
toward the Phe P-l substrate than £ -branching, as 
evidenced by the 100 fold decrease in kcat/Km for the 
Phe substrate in going from L166 to 1166. 

20 Reductions in kcat/Km resulting from increases in side 
chain size in the S-l subsite, or specific structural 
features such as 0- and 7 -branching, are quantita- 
tively illustrated in Figure 16. The kcat/Km values 
for the position 166 mutants determined for the Ala, 

25 Met, Phe, and Tyr P-l substrates (top panel through 
bottom panel f respectively) , are plotted versus the 
position 166 side-chain volumes (Chothia, C. (1984) 
Ann. Rev. Biochem. 53, 537-572) . Catalytic efficiency 
for the Ala substrate reaches a maximum for 1166, and 

30 for the Met substrate it reaches a. maximum between 
V166 and L166. The Phe substrate shows a broad 
kcat/Km peak but is optimal with A166. Here, the 
^-branched position 166 substitutions form a line that 
is parallel to, but roughly 50 fold lower in kcat/Km 

35 than side-chains of similar size [i.e., C166 versus 



T166, L166 versus 1166]. The Tyr substrate is most 
efficiently utilized by wild type enzyme (Glyl66) , and 
there is a steady decrease as one proceeds to large 
position 166 side-chains. The £ -branched and 
7 -branched substitutions form a parallel line below 
the other non-charged substitutions of similar 
molecular volume. 

The optimal substitution at position 166 decreases in 
volume with increasing volume of the PI substrate 
[i.e., 1166/Ala substrate, L166/Met substrate, 
A166/Phe substrate, Glyl66/Tyr substrate]. The 
combined volumes for these optimal pairs may 
approximate the volume for productive binding in the 
S-l subsite. - For the optimal pairs, Glyl66/Tyr 
substrate, A166/Phe substrate, L166/Met substrate, 
V166/Met substrate, and 1166/Ala substrate, the 
combined volumes are 266,295,313,339 and 261 A 3 , 
respectively. Subtracting the volume of the peptide 
backbone from each pair (i.e., two times the volume of 
glycine) , an average side-chain volume of 160+32A 3 for 
productive binding can be calculated. 

The effect of volume, in excess to .the productive 
binding volume, on the drop in transition-state 
binding energy can be estimated from the Tyr substrate 
curve (bottom panel, Figure 16), because these data, 
and modeling studies (Figure 2) , suggest that any 
substitution beyond glycine causes steric repulsion. 
A best-fit line drawn to all the data (r = 0.87} gives 
a slope indicating a loss of roughly 3 kcal/mol in 
transition state binding energy per 100A 3 of excess 
volume. (100A is approximately the size of a leucyl 
side-chain.) 
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D Enhanced Catalytic Efficiency 

Correlates with Increasing Hydrophobicity 
of the Position 1 ** Substitution 

substantial increases in kcat/Km occur with 
enlargement of the position 166 side-chain, except for 
the Tyr P-l substrate (Figure 16). For example, 
Jccat/Km increases in progressing from Glyl66 to 1166 
for the Ala substrate (net of ten-fold), from Glyl66 
to L166 for the Met substrate (net of ten-fold) and 
from Glyl66 to A166 for the Phe substrate (net of 
two-fold) . The increases in kcat/Km cannot be 
entirely explained by the attractive terms in the van 
der Waals potential energy function because of their 
strong distance dependence (1/r 6 ) and because of the 
weak nature of these attractive forces (Jencks, W.P., 
15 catalysis in Chemistry and Enzvraoloav (McGraw-Hill, 
1969) pp. 321-436; Fersht, A., Bnzvme Structure and 
Mechanism (Freeman, San Francisco, 1977) pp. 226-287; 
Levitt, M. (1976) J- Mol. Biol. 10±r 59-107). For 
example, Levitt (Levitt, M. (1976) J. Mol. Bio}. 104, 
59-107) has calculated that the van der Waals 
attraction between two methionyl residues would 
produce a maximal interaction energy of roughly -0.2 
kcal/mol. This energy would translate to only 1.4 
fold increase in kcat/Km. 

25 0 ^ 

The increases of catalytic efficiency caused by 

side-chain substitutions at position 166 are better 

accounted for by increases in the hydrophobicity of 

the S-l subsite. The increase kcat/Km observed for 

30 the Ala and Met substrates with increasing position 
166 side-chain size would be expected, because 
hydrophobicity is roughly proportional to side-chain 
surface area (Rose, G.D., et al. (1985) Science 221, 
834-838; Reynolds, J.A., et al. (1974) Proc. Natl. 

35 ftcad. Sci. USA 71. 2825-2927). 



20 
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Another example that can be interpreted as a 
hydrophobic effect is Been when comparing kcat/Km for 
isosteric substitutions that differ in hydrophobicity 
such as S166 and C166 (Figure 16) . Cysteine is 
considerably more hydrophobic than serine (-1.0 versus 
+0.3 kcal/mol) (Nozaki, Y., et al. (1971) J. Biol. 
Chem. 246, 2211-2217; Tanford, C. (1978) Science 200 . 
1012) . The difference in hydrophobicity. correlates 
with the observation that C166 becomes more efficient 
relative to Serl66 as the hydrophobicity of the 
substrates increases (i.e., Ala < Met < Tye < Phe) . 
Steric hindrance cannot explain these differences 
because serine is considerably smaller than cysteine 
(99 versus 118A 3 ). Paul, I.C., Chemistry of the -5H 
Group (ed. S. ~Patai, Wiley Interscience, New York, 
1974) pp. 111-149. 



E. Production of an Elastase-Like 
Specificity in Subtilisin 

The 1166 mutation illustrates particularly well that 
large changes in specificity can be produced by 
altering the - structure and hydrophobicity of the S-l 
subsite by a single mutation (Figure 17) . Progressing 
through the small hydrophobic substrates, a maximal 
specificity improvement over wild type occurs for the 
Val substrate (16 fold in kcat/Km) . As the substrate 
side chain size increases, these enhancements shrink 
to near unity (i.e., Leu and His substrates). The 
1166 enzyme becomes poorer against larger aromatic 
substrates of increasing size (e.g., 1166 is over 
1,000 fold worse against the Tyr substrate than is 
Glyl66) . We interpret the increase in catalytic 
efficiency toward the small hydrophobic substrates for 
1166 compared to Glyl66 to the greater hydrophobicity 
of isoluecine (i.e., -1.8 kcal/mol versus 0). Nozaki, 
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y. # |t al. (1971) J- Biol. Chenu 246, 2211-2217; 
Tanford, C. (1978) Science 200, 1012. The decrease in 
catalytic efficiency toward the very large substrates 
for 1166 versus Glyl66 is attributed to steric 
repulsion. 

5 

The specificity differences between Glyl66 and 1166 
are similar to the specificity differences between 
chymotrypsin and the evolutionary relative, elastase 
(Harper, J.W. , et al (1984) Biochemistry 23, 

10 2995-3002). In elastase, the bulky amino acids, Thr 
and Val, block access to the P-l binding site for 
large hydrophobic substrates that are preferred by 
chymotrypsin. In addition, the catalytic efficiencies 
toward small hydrophobic substrates are greater for 

15 elastase than for chymotrypsin as we obeseve for 1166 
versus Glyl66 in subtilisin. 

EXAMPLE 4 

20 Substitution of Ionic Amino 
Acids for Glvl66 • 

The construction- of subtilisin mutants containing the 
substitution of the ionic amino acids Asp, Asn, Gin, 
Lys and Ang are disclosed in EP0 Publication No. 

25 0130756. The present example describes the 
construction of the mutant subtilisin containing Glu 
at position 166 (E166) and presents substrate 
specificity data on these mutants, Further, data on 
position 166 and 156 single and double mutants is 

30 presented infra . 

pA 166, described in Example 3, was digested with Sad 
and Xmal. The double strand DNA cassette (underlined 
and overlined) of line 4 in Figure 13 contained the 

35 



triplet GAA for the codon 166 to encode the 
replacement of Glu for Glyl66. This mutant plasmid 
designated pQ166 was propagated in BG2036 as 
described. This mutant 6ubtilisin, together with the 
other mutants containing ionic BUbstituent amino acids 
at residue 166, vere isolated as described and further 
analyzed for variations in substrate specificity. 

Each of these mutants was analyzed with the 
tetrapeptide substrates, succinyl-L-AlaL-AlaProL-X- 
-p-nitroanilide, where X was Phe, Ala and Glu. 

The results of this analysis are shown in Table IX. 

TABLE IX 

P-l Substrate 
(kcat/Km x 1(T 4 ) 



Position 166 


Phe 


Ala 


Glu 


Gly (wild type) 


36 . 0 


1.4 


0.002 


Asp (D) 


0.5 


0.4 


<0.001 


Glu (E) 


3.5 


0.4 


<0.001 


Asn (N) 


18.0 


1.2 


0.004 


Gin (Q) 


57.0 


2.6 


0.002 


Lys (K) 


52.0 


2.8 


1.2 


Arg (R) 


42.0 


5.0 


0.08 



These results indicate that charged amino acid 
substitutions at Glyl66 have improved catalytic 
efficiencies (kcat/Km) for oppositely charged P-l 
substrates (as much as 500 fold) and poorer catalytic 
efficiency for like charged P-l substrates. 
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EXAMPLE 5 

substitution of Glycine at Position 169 
The substitution of Glyl69 in g. amvlblio uef aciens 
subtilisin with Ala and Ser is described in EPO 
Publication No. 0130756. The same method was used to 
make the remaining 17 mutants containing all other 
substituent amino acids for position 169. 

The construction protocol is summarized in Figure 18. 
The overscored and underscored double stranded DNA 
cassettes used contained the following triplet 
encoding the substitution of the indicated amino acid 
at residue 169. 



GCT 


A 


ATG 


M 


TGT 


C 


AAC 


N 


GAT 


D 


CCT 


P 


GAA 


£ 


CAA 


Q 


TTC 


F 


AGA 


R 


GGC 


G 


AGC 


S 


CAC 


H 


ACA 


T 


ATC 


I 


GTT 


V 


AAA 


' K 


TGG 


W 


CTT 


L 


TAC 


y 



Each of the plasmids containing a substituted Glyl69 
was designated pX169, where X represents the 
substituent amino acid. The mutant subtilisins were 
simialrly designated. 

Two of the above mutant subtilisins, A169 and S169, 
were analyzed for substrate specificity against 
synthetic substrates containing Phe, Leu, Ala and Arg 
in the P-l position. The following results are shown 
in Table X. 
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TABLE X 

Effect of Serine and Alanine Mutations 
at Position 169 on P-l Substrate Specificity 

— - 4 i 



Position 169 


Phe- 




Ala 


Arg 


Gly (wild type) 


40 


10 


1 


0.4 


A169 


120 


20 


1 


0.9 


S169 


50 


10 


1 


0.6 



These results indicate that substitutions of Ala and 
Ser at Glyl69 have remarkably similar catalytic 
efficiencies against a range of P-l substrates 
compared to their position 166 counterparts • This is 
probably because position 169 is at the bottom of the 
P-l specificity subsite. 

EXAMPLE 6 

Substitution at Position 104 

Tyrl04 has been substituted with Ala, His, Leu, Met 
and Ser. The method used was a modification of the 
site directed mutagenesis method. According to the 
protocol of Figure 19, a primer (shaded in line 4) 
introduced a unique Hind lll site and a frame shift 
mutation at codon 104. Restriction-purification for 
the unique Hind lll site facilitated the isolation of 
the mutant sequence (line 4). Restriction-selection 
against this Hind lll site using pimers in line 5 was 
used to obtain position 104 mutants. 
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The following triplets were used in the primers of 

Figure 19, line 5 for the 104 codon which substituted 
the following amino acids. 



5 



10 



GCT 


A 


TTC 


F 


ATG 


M 


CCT 


P 


CTT 


L 


ACA 


T 


AGC 


S 


TGG 


W 


CAC 


H 


TAC 


Y 


CAA 


Q 


GTT 


V 


GAA 


E 


AGA 


R 


GGC 


G 


AAC 


N 


ATC 


I 


GAT 


D 


AAA 


K 


TGT 


C 



The substrates in Table XI were used to analyze the 
substrate specificity of these mutants. The results 
obtained fo H104 subtilisin are shown in Table XI. 



20 



Substrate 

25 sAAPFpNA 
sAAPApNA 
sFAPFpNA 
sFAEApNA 

30 



TABLE XI 



teat Km Kcat/ftn 



wr 


H104 


wr 




H104 


wr 


ffl.04 


50.0 


22.0 


1.4x10" 


•4 


7.1xl0" 4 


3.6xl0 5 


3.1xl0 4 


3.2 


2.0 


2.3x10" 


•4 


1.9xl0~ 3 


1.4xl0 4 


lxlO 3 


26.0 


38.0 


1.8xl0~ 4 


4.1xl0 -4 


1.5xl0 5 


9.1xl0 4 


0.32 


2.4 


7.3x10" 


•5 


1.5xl0" 4 


4.4xl0 3 


1.6xl0 4 



From these data it is clear that the substitution of 
His for Tyr at position 104 produces an enzyme which 
is more efficient (higher kcat/Km) when Phe is at the 
p-4 substrate position than when Ala is at the P-4 
35 substrate position. 
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Substitution of Alal52 

Alal52 has been substituted by Gly and Ser to 
determine the effect of such substitutions on 
substrate specificity. 



10 



15 



The wild type DNA sequence was mutated by the 
V152/P153 primer (Figure 20, line 4) using the above 
restriction-purification approach for the new Kpn l 
site. Other mutant primers (shaded sequences Figure 
20; S152, line 5 and G152, line 6) mutated the new 
Kpn l site away and such mutants were isolated using 
the restriction-selection procedure as described above 
for loss of the Kpn l site. 



20 



The results of these substitutions for the above 
synthetic substrates containing the P-l amino acids 
Phe, Leu and Ala are shown in Table XII. 



TABLE XII 



25 



Position 152 



Phe 



P-l Substrate 
(kcat/Kmxl0~ 4 ) 



Leu 



Ala 



30 



Gly (G) 

Ala (wild type) 
Ser (S) 



0.2 0.4 <0.04 

40.0 10.0 1.0 
1.0 0.5 * 0.2 



35 



These results indicate that, in contrast to positions 
166 and 169, replacement of Alal52 with Ser or Gly 
causes a dramatic reduction in catalytic efficiencies 
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across all substrates tested. This suggests Alal52, 
at the top of the S-l subsite, may be the optimal 
amino acid because Ser and Gly are homologous Ala 
substitutes . 

5 

EXAMPLE 8 
Substitution at Position 156 

Mutants containing the substitution of Ser and Gin for 
10 Glul56 have been constructed according to the overall 
method depicted in Figure 21. This method was 
designed to facilitate the construciton of multiple 
mutants at position 156 and 166 as will be described 
hereinafter. However, by regenerating the wild type 
15 Glyl66, single mutations at Glul56 were obtained. 

The plasmid pal66 is already depicted in line 2 of 
Figure 13. The synthetic oligonucleotides at the top 
right of Figure 21 represent the same DNA cassettes 
20 depicted in line 4 of Figure 13. The plasmid pl66 in 
Figure 21 thus represents the mutant plasmids of 
Examples 3 and 4. In this particular example, pl66 
contains the wild type Glyl66. 

25 Construction of position 156 single mutants were 
prepared by ligation of the three fragments (1-3) 
indicated at the bottom of Figure 21. Fragment 3, 
containing the carboxy-terminal portion of the 
subtilisin gene including the* wild type position 166 

30 codon, was isolated as a 610 bp Sac I- BamH I .fragment. 
Fragment 1 contained the vector sequences, as well as 
the amino-terminal sequences of the subtilisin gene 
through codon 151. To produce fragment 1, a unique 
Kpn l site at codon 152 was introduced into the wild 

35 type subtilisin sequence from pS4.5. site-directed 



mutagenesis in M13 employed a primer having the 
sequence 5 » -TA-GTC-GTT-GCG-GTA-CCC-GGT-AAC-GAA-3 ' to 
produce the mutation. Enrichment for the mutant 
sequence was accomplished by restriction with KpnI. 
purification and self ligation. The mutant sequence 
containing the XEDl site was confirmed by direct 
plasmid sequencing to give pV152. pV152 (-1 p g) was 
digested with g^nl and treated with 2 units of DNA 
polymerase I large fragment (Klenow fragment* from 
Boeringer-Mannheim) plus 50 ,/M deoxynucleotide 
triphosphates at 37 'C for 30 min. This created a 
blunt end that terminated with codon 151. The DNA was 
extracted with 1:1 volumes phenol and CHC1 3 and DNA in 
the aqueous phase was precipitated by addition of 0,1 
volumes 5M ammonium acetate and two volumes ethanol. 
After centrifugation and washing the DNA pellet with 
70% ethanol, the DNA was lyophilized. DNA was 
digested with BamHI and the 4.6kb piece (fragment 1) 
was purified by acrylamide gel electrophoresis 
followed by electroelution. Fragment 2 was a duplex 
synthetic DNA cassette which when ligated with 
fragments 1 and 3 properly restored the coding 
sequence except at codon 156. The top strand was 
synthesized to contain a glutamine codon, and the 
complementary bottom strand coded for serine at 156. 
Ligation of heterophosphorylated cassettes leads to a 
large and favorable bias for the phosphorylated over 
the non-phosphorylated oligonucleotide sequence in the 
final segrated plasmid product. Therefore, to obtain 
Q15S the top strand was phosphorylated, and annealed 
to the non-phosphorylated bottom strand prior to 
ligation. Similarly, to obtain S156 the bottom strand 
was phosphorylated and annealed to the 
non-phosphorylated top strand. Mutant sequences were 
isolated after ligation and transformation,, and were 
confirmed by restriction analysis and DNA sequencing 
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as before. To express variant subtilisins, plasmids 
vere transformed into a subtil isin-neutral protease 
deletion mutant of g. subtil is . BG2036, as previously 
described. Cultures vere fermented in shake flasks 
for 24 h at 37 *C in LB media containing 12.5 mg/mL 
chloramphenicol and subtilisin was purified from 
culture supernatants as described. Purity of 
subtilisin was greater than 95% as judged by SDS PAGE. 

These mutant plasmids designated pS156 and pQ156 and 
mutant subtilisins designated S156 and Q156 were 
analyzed with the above synthetic substrates where P-l 
comprised the amino acids Glu # Gin, Met and Lys. The. 
results of this analyses are presented in Example 9. 



15 

EXAMPLE 9 



Multiple Mutants With Altered 
Substrate Specificity - Substitution 
at Positions 156 and 166 

Single substitutions of position 166 are described in 
Examples 3 and 4. Example 8 describes single 
substitutions at position 156 as well as the protocol 
of Figure 21 whereby various double mutants comprising 
the substitution of various amino acids at positions 
156 and 166 can be made. This example describes the 
construction and substrate specificity of subtilisin 
containing substitutions at position 156 and 166 and 
summarizes some of the data for single and double 
mutants at positions 156 and 166 with various 
substrates . 

K166 is a common replacement amino acid in the 156/166 
mutants described herein. The replacement of Lys for 
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Glyl66 was achieved by using the synthetic DNA 
cassette at the top right of Figure 21 which contained 
the triplet AAA for NNN. This produced fragment 2 
with Lys substituting for Glyl66. 

The 156 substituents were Gin and Ser. The Gin and 
Ser substitutions at Glyl56 are contained within 
fragment 3 (bottom right Figure 21) . 

The multiple mutants were produced by combining 
fragments 1, 2 and 3 as described in Example 8. The 
mutants Q156/K166 and S156/K166 were selectively 
generated by differential phosphorylation as 
described. Alternatively, the double 156/166 mutants, 
c.f. Q156/K166 and S156/K166, were prepared by 
ligation of the 4.6kb SacI-BainHI fragment from the 
relevant pl56 plasmid containing the 0.6kb Sac I- BamH I 
fragment from the relevant pl66 plasmid. 

These mutants, the single mutant K166, and the S156 
and Q156 mutants of Example 8 were analyzed for 
substitute specificity against synthetic polypeptides 
containing Phe or Glu as the P-l substrate residue. 
The results are" presented in Table XIII. 
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As can be seen in Table XIV, either of these single 
mutations improve enzyme performance upon substrates 
with glutamate at the P-l enzyme binding site. When 
these single mutations were combined, the resulting 
multiple enzyme mutants are better than either parent. 
These single or multiple mutations also alter the 
5 relative pH activity profiles of the enzymes as shown 
in Figure 23. 

To isolate the contribution of electrostatics to 
substrate specificity from other chemical binding 

10 forces, these various single and double mutants were 
analyzed for their ability to bind and cleave 
synthetic substrates containing Glu, Gin, Met and Lys 
as the P-l substrate amino acid. This permitted 
comparisons between side-chains that were more 

15 sterically similar but differed in charge (e.g., Glu 
versus Gin, Lys versus Met) . Similarly, mutant 
enzymes were assayed against homologous P-l substrates 
that were most sterically similar but differed in 
charge (Table XIV) . 
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Footnotes -bo Table XIV ; 0251446 

^ B. subtilis . BG 2036, expressing indicated 
variant subtilisin were fermented and enzymes purified 
as previously described (Estell, et al. (1985) J. 
Biol. Chem. 260 , 6518-6521). Wild type subtilisin is 
indicated (wt) containing Glul56 and Glyl66. 

^ Net charge in the P-l binding site is defined as 
5 the sum of charges from positions 156 and 166 at pH 
8.6. 

f c} -1 

1 ' Values for Jccat(s ) and Km(M) were measured in 
0.1M Tris pH 8.6 at 25 'C as previously described 
against * P-l substrates having the form 
succinyl-L-AlaL-AlaL-ProL- [X] -p-nitroanilide , where X 
is the indicated P-l amino acid. Values for log 1/Km 
10 are shown inside parentheses. All errors in 
determination of kcat/Km and 1/Km are below 5%. 

^ Because values for Glul56/Aspl66 (D166) are too 
small to. determine accurately, the maximum difference 
taken for GluP-1 substrate is limited to a charge 
range of +1 to -1 charge change. 
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n.d. = not determined 

The kcat/Km ratios shown are the second order rate 
constants for the conversion of substrate to product, 
and represent the catalytic efficiency of the enzyme. 
These ratios are presented in logarithmic form to 
scale the data, and because log kcat/Km is 
proportional to" the lowering of transition-state 
activation energy (AG T ) . Mutations at position 156 
and 166 produce changes in catalytic efficiency toward 
Glu, Gin, Met and Lys P-l substrates of 3100, 60, 200 
and 20 fold, respectively. Making the P-l 
binding-site more positively charged [e.g., compare 
Glnl56/Lysl66 (Q156/K166) versus Glul56/Metl66 
(Glul56/M166) ] dramatically increased kcat/Km toward 
the Glu P-l substrate (up to 3100 fold) , and decreased 
the catalytic efficiency toward the Lys P-l substrate 
(up to 10 fold). In addition, the results show that 
the catalytic efficiency of wild type enzyme can be 
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greatly improved toward any of the four P-l substrates 
by mutagenesis of the P-l binding site. 



The changes in kcat/Km are caused predominantly by 
changes in 1/Km. Because 1/Km is approximately equal 
to 1/Ks, the enzyme-substrate association constant, 

5 the mutations primarily cause a change in substrate 
binding. These mutations produce smaller effects on 
kcat that run parallel to the effects on 1/Km. The 
changes in kcat suggest either an alteration in 
binding in the P-l binding site in going from the 

10 Michael is-complex E»S) to the transition-state complex 
(E-S/) as previously proposed (Robertus, J.D., et al- 
(1972) Biochemistry 11, 2439-2449; Robertus, J.D., et 
al. (1972) Biochemistry 11, 4293-4303), or change in 
the position of the scissile peptide bond over the 

15 catalytic serine in the E*S complex. 

Changes in substrate preference that arise from 
changes in the net charge in the P-l binding site show 
trends that are best accounted for by electrostatic 

2 0 effects (Figure 28). As the P-l binding cleft becomes 
more positively charged, . the average catalytic 
efficiency increases much more for the Glu P-l 
substrate than for its neutral and isosteric P-l 
homolog, Gin (Figure 28A) . Furthermore, at the 

25 positive extreme both substrates have nearly identical 
catalytic efficiencies. 

In contrast, as the P-i site becomes more positively 
charged the catalytic efficiency toward the Lys P-l 
J0 substrate decreases, and diverges sharply from its 
neutral and isosteric homolog, Met (Figure 28B) . The 
similar and parallel upward trend seen with increasing 
positive charge for the Met and Glu P-l substrates 
probably results from the fact that all the substrates 
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are succinylated on their amino-terminal end, and thus 
carry a formal negative charge. 



The trends observed in log kcat/Km are dominated by 
changes in the Km term (Figures 28C and 28D) . As the 
pocket becomes more positively charged, the log 1/Km 

5 values converge for Glu and Gin P-l substrates (Figure 
28C) , and diverge for Lys and Met P-l substrates 
(Figure 28D) . Although less pronounced effects are 
seen in log kcat, the effects of P-l charge on log 
kcat parallel those seen in log 1/Km and become larger 

10 as the p ~ 1 P ocket becomes more positively charged. 
This may result from the fact that the transition- 
state is a tetrahedral anion, and a net positive 
charge in the enzyme may serve to provide some added 
stabilization to the transition-state. " 

15 

The effect of the change in P-l binding-site charge on 
substrate preference can be estimated from the 
differences in slopes between the charged and neutral 
isosteric P-l substrates (Figure 28B) . The average 

20 change in substrate preference (Aloq kcat/Km) between 
charged and neutral isosteric substrates increases 
roughly 10-fold as the complementary charge or the 
enzyme increases (Table XV) . Khen comparing Glu 
versus Lys, this difference is 100-fold and the change 

25 in substrate preference appears predominantly in the 
Km term. 
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TABLE XV 0251446 



10 



Differential Effect on Binding Site 
Charge on log kcat/Km or (log 1/Km) (a) 
for P-l Substrates that Differ in Charge 



Change in P-l Binding Alog kcat /Km (Alog 1/Km) 

Site Charge (b) GluGln MetLys GluLys 

-2 to -1 n.d. 1.2 (1.2) n.d. 

_1 to 0 0.7 (0.6) 1.3 (0.8) 2.1 (1.4) 

0 to +1 1.5 (1.3) 0.5 (0.3) 2.0 (1.5) 

Avg. change in 
log kcat/K or 
(log 1/Km) per 

unit charge change 1.1 (1.0) 1.0 (0.8). 2.1 (1.5) 



15 (a) The difference in the slopes of curves were taken 
between the P-l substrates over the charge interval 
given for log (kcat/Km) (Figure 28A, B) and (log 1/Km) 
(Figure 28C, D) . Values represent the differential 
effect a charge change has in distinguishing the 
substrates that are compared. 

20 (b) Charge in P-l binding site is defined as the sum 
of charges from positions 156 and 166. 
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The free energy of electrostatic interactions in the 
structure and energetics of salt-bridge formation 
depends on the distance between the charges and the 
microscopic dielectric of the media. To dissect these 
structural and microenvironmental effects, the 

5 energies involved in specific salt-bridges were 
evaluated. In addition to the possible salt-bridges 
shown (Figures 29A and 29B) , reasonable salt-bridges 
can be built between a Lys P-l substrate and Asp at 
position 166, and between a Glu P-l substrate and a 

10 Lys at position 166 (not shown) . Although only one of 
these structures is confirmed by X-ray crystal ography 
(Poulos, T.L. , et al. (1976) J, Mol. Biol. 257 
1097-1103), all models have favorable torsion angles 
(Sielecki, A.R., et al. (1979) J. Mol, Biol. 134 . 

15 781-804)., and do not introduce unfavorable van der 
Waals contacts. 

The change in charged P-l substrate preference brought 
about by formation of the model salt-bridges above are 
20 shown in Table XVI. 
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Footnotes to Table XVI : 

Molecular modeling shows it is possible to form a 
salt bridge between the indicated charged P-l 
substrate and a complementary charge in the P-l 
binding site of the enzyme at the indicated position 
changed. 

^ Enzymes compared have sterically similar amino 
acid substitutions that differ in charge at the 
indicated position. 

( c) 

v ' The P-l substrates compared are structurally 
similar but differ in charge. The charged P-l 
substrate is complementary to the charge change at the 
position indicated between enzymes 1 and 2. 

^ Date from Table XIV was used to compute the 
difference in log (kcat/Km) between the charged and 
the non-charged P-l substrate (i.e., the substrate 
preference) . The substrate preference is shown 
separately for enzyme 1 and 2. 

' The difference in substrate preference between 
enzyme 1 (more highly charged) and enzyme 2 (more 
neutral) represents the rate change accompanying the 
electrostatic interaction. 



The difference between catalytic efficiencies (i.e., 
Alog kcat/Km) for the charged and neutral P-l 
substrates (e.g., Lys minus Met or Glu minus Gin) give 
the substrate preference for each enzyme. The change 
in substrate preference* (AAlog kcat/Km) between the 
charged and more neutral enzyme homologs (e.g., 
Glul56/Glyl66 minus Glnl56 (Q156)/Glyl66) reflects the 
change in catalytic efficiency that may be attributed 
solely to electrostatic effects. 



These results show that the average change in 
substrate preference is considerably greater when 
electrostatic substitutions are produced at position 
166 (50-fold in kcat/Km) versus position 156 (12-fold 
in kcat/Km) . From these a A log kcat/Km values, an 
average change in transition-state stabilization 
energy can be calculated of -1.5 and -2.4 kcal/mol for 
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substitutions at positions 156 and 166, respectively. 
This should represent the stabilization energy 
contributed from a favorable electrostatic interaction 
for the binding of free enzyme and substrate to form 
the transition-state complex. 

5 EXAMPLE 10 

Substitutions at Position 217 

Tyr217 has been substituted by all other 19 amino 
10 acids. Cassette mutagenesis as described in EPO 
publication No. 0130756 w.as used according to the 
protocol of Figure 22. The EcoRV restriction site was 
used for restriction-purification of pA217. 

15 Since this position is involved in substrate binding, 
' mutations here effect kinetic parameters of the 
enzyme. An example is the substitution of Leu for Tyr 
at position 217. For the substrate sAAPFpNa , this 
mutant has a kcat of 277 5' and a Km of 4.7xl0~ with 

20 a kcat/Km ratio of 6xl0 5 . This represents a 5.5-fold 
increase in kcat with a 3-fold increase in Km over the 
wild type enzyme'. 

In addition, replacement of Tyr 217 by Lys, Arg, Phe or 
25 Leu results in mutant enzymes which are more stable at 
pHs of about 9-11 than the WT enzyme. Conversely, 
replacement of Tyr217 by Asp, Glu, Gly or Pro results 
in enzymes which are less stable at pKs of about 9-11 
than the WT enzyme. - 

30 
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Multiple Mutants Having 
Altered Thermal stability 

a. ^ylolig^facien subtilisin does not contain any 
5 cysteine residues. Thus, any attest to produce 
thecal stability by Cys cross-linkage required the 
substitution of more than one amino acid in subtilisin 
with cys. The following subtilisin residues were 
multiply substituted with cysteine: 
10 Thr22/Ser87 

Ser24/Ser87 

Mutagenesis of Ser24 to Cys was carried out with a 5- 
Phosphorylated - oligonucleotide priaer having the 
15 sequence 

5 • -pC-TAC-ACT-GGA^TGC-AAT-GTT-AAA-G-3 » . 

(Asterisks show the location of mismatches and the 
20 underlined sequence shows the position of the altered 
Sau3A S1 te.) The B. amvlolWf^o^ subtilisin gene 
on a 1.5 kb EcoRI-baiihi fragment from P S4.5 was cloned 
into M13m P n and single stranded DNA was isolated 
This template (M13mpHSUBT) was double primed with the 
25 5' phosphorylated M13 universal sequencing primer and 
the mutagenesis primer. Adelman, et al. (19 83 ) DNA 2 
183-193. The heteroduplex was transfected into 
competent JM101 cells and plaques were probed for the 
mutant sequence (Zoller, a. J., et al. (19S2) Nueleic 
30 A£id__Rej L 10, 6487-6500; Wallace, et al. (i 981) 
*** ** ■ 3647-3656, using a 

tetramethylammoniua chloride hybridization protocol 
(Wood, e£ al. (1985) Proe . Wat1 , cad> gci mx ^ 

1585-1588) . The Ser87 to Cys mutation, was prepared in 

35 
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, „ e < nn a 5« phosphorylated primer 

a similar fashion using a » P"" e * 

having the sequence 

5 • -pGGC-GTT-GCG-CCA-TGC-GCA-TCA-CT-3 ' . 

(Th e asterisk indicates the position of the ^mismatch 
- and the underlined sequence shows the position of a 
' Tew ^1 site.) The C2< and C87 mutations were 

ohtaitd at a frequency of one and two percent, 

respectively. Mutant sequences were confirmed by 

dideoxy sequencing in M13. 

10 Mutagenesis o£ Tyr21/Thr2 2 to A21/C22 was carried out 
with a 5- phosphorylated oligonucleotide primer havmg 

the sequence 

15 s.-pAC-TCT-CW-GSC-aa-MT-OGS^Ttt-AAT-GIT-S' . 

(The asterisXs show mismatches to the wild type 
sequence and the underUned sequence .how. . the 
position of an altered £fiU.3A It.., "f^"""*" 
20 hetereduple* synthesis ver. identical to those 
described for C24. Because direct clonmg of the 
heteroduple* DMA fragment can yield increased 
fancies of mutagenesis, the MffX-BtfaT subtUisin 
fragment was purified and ligated into pBS42. tajU 
„ m 294 cells were transformed with the ligation 
" Iture and plasmid PH* was purified from isolated 
transformants. Plasmid DKA was screened or th loss 
of the S.U3A site at codon 23 that was eliminate* by 

" ■ . „_,„., two out of 16 plasmid 

the mutagenesis primer. two 

,„ preparations had lost the wild type Sin3» site. ■ *>» 
30 mutant sequence was confirmed by dideo*y sequencing in 

M13 . 
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Double mutants, C22/C87 and C24/C87, were constructed 
by ligating fragments sharing a common Cla l site that 
separated the single parent cystine codons. 
Specifically, the 500 bp EcoRI-Clal fragment 
containing the 5 1 portion of the subtilisin gene 
(including codons 22 and 24) was ligated with the 4.7 
kb Clal-EcoRI fragment that contained the 3' portion 
of the subtilisin gene (including codon 87) plus pBS42 
vector sequence. E. coli MM 294 was transformed with 
ligation mixtures and plasnid DNA was purified from 
individual transformants. Double-cysteine plasmid 
constructions were identified by restriction site 
markers originating from the parent cysteine mutants 
(i.e., C22 and C24, Sau 3A minus; Cys87, Mst I plus). 
Plasmids from E. coli were transformed into g. 
subtilis BG203 6. The thermal stability of these 
mutants as compared to wild type subtilisin are 
presented in Figure 30 and Tables XVII and XVIII. 
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Effect of DTT on the Half-Time of 
Autolytic Inactivation of Wild-Type 
and Disulfide Mutants of Subtilisin* 



Enzyme 



-DDT 



+DTT 



-DTT/+DTT 



mm 



10 



Wild-type 

C22/C87 

C24/C87 



95 
44 
92 



85 
25 
62 



1.1 
1.8 
1.5 



Purified enzymes were either treated or not 
treated with 25mM DTT and dialyzed with or without 
lOmM DTT in 2mM CaCl 2 , 50mM Tris (pH 7.5) for 14 hr. 
at 4°C. Enzyme concentrations were adjusted to 80pl 
aliquots were quenched on ice and assayed for residual 
activity. Half-times for autolytic inactivation were 
determined from semi-log plots of 1o 9iq (residual 
activity) versus time. These plots were linear for 
over 90% of the inactivation. 
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Effect of Mutations in Subtilisin 
. on the Half -Time of Autolytic 
Inactivation at 58 *C* 



Enzyme j 

5 ffiin 

Wild-type 120 

C22 22 

C24 120 

C87 104 

10 C22/C87 43 

C24/C87 115 



Half-times for autolytic inactivation were 
15 determined for wild-type and mutant subtilisins as 
described in the legend to Table III. Unpurified and 
non-reduced enzymes were used directly from B. 
subtilis culture supernatants . 

The disulfides introduced into subtilisin did not 
20 im P rove the autolytic stability of the mutant enzymes 
when compared to the wild-type enzyme. However, the 
disulfide bonds did provide a margin of autolytic 
stability when compared to their corresponding reduced 
double-cysteine enzyme. Inspection of a highly 
15 refined x-ray structure of wild-type B. amvloliaue - 
faciens subtilisin reveals a hydrogen bond between 
Thr22 and Ser87. Because cysteine is a poor hydrogen 
donor or acceptor (Paul, I.e. (1974) in Chemistry of 
the -SH Group (Patai, S., ed.) pp. 111-149, Wiley 
Interscience, New York) weakening of 22/87 hydrogen 
bond may explain why the C22 and C87 single-cysteine 
mutant proteins are less autolytically stable than 
either C24 or wild-type (Table XVIII). The fact that 
C22 is less autolytically stable than C87 may be the 
result of the Tyr21A mutation (Table XVIII) . Indeed, 



0251 446 

-104- 

construction and analysis of Tyr21/C22 shows the 
mutant protein has an autolytic stability closer to 
that of C87. In summary, the C22 and C87 of 
single-cysteine mutations destabilize the protein 
toward autolysis, and disulfide bond formation 
increases the stability to a level less than or equal 
to that of wild-type enzyme. 



EXAMPLE 12 

10 

Multiple Mutants Containing Substitutions 
at Position 222 and Position 166 or 169 

Double mutants 166/222 and 169/222 were prepared by 
ligating together (1) the 2.3kb Acall fragment from 

15 pS4.5 which contains the 5' portion of the subtilisin 
gene and vector sequences, (2) the 200bp Ava il 
fragment which contains the relevant 166 or 169 
mutations from the respective 166 or 169 plasmids, and 
(3) the 2.2kb Ava il fragment which contains the 

2 0 relevant 222 mutation 3 1 and of the subtilisin genes 
and vector sequence from the respective p222 plasmid. 

Although mutations at position 222 improve oxidation 
stability they also tend to increase the Km. An 
25 example is shown in Table XIX. In this case the A222 
mutation was combined with the K166 mutation to give 
an enzyme with Jccat and Km intermediate between the 
two parent enzymes. 
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5 





kcat 


Km 


WT 


50 


1.4xl0~ 4 


A222 


42 


9.9xl0~ 4 


K166 


21 


3.7X10" 5 


K166/A222 


29 


2.0xl0~ 4 



substrate sAAPFpNa 

10 

EXAMPLE 13 

Multiple Mutants Containing 
15 Substitutions at Positions 50, 156, 
166. 217 and Combinations Thereof 

The double mutant S156/A169 was prepared by ligation 
of two fragments, each containing one of the relevant 
mutations. The plasmid pS156 was cut with Xma l and 

20 treated with SI nuclease to create a blunt end at 
codon 167. After removal of the nuclease by 
phenol/chloroform extraction and ethanol precipita- 
tion, the DNA was digested with BamHI and the 
approximately 4kb fragment containing the vector plus 

25 the 5 1 portion of the subtilisin gene through codon 
167 was purified. 

The pA169 plasmid was digested with Kpn l and treated 
with DNA polymerase Klenow fragment plus 50 «M dKTPs 
30 to create a blunt end codon at codon 168. The Klenow 
was removed by phenol/chloroform extraction and 
ethanol precipitation. The DNA was digested with 
BamHI and the 590bp fragment including codon 168 
through the carboxy terminus of the subtilisin gene 

35 
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was isolated. The two fragments were then ligated to 
give S156/A169. 



Triple and quadruple mutants were prepared by ligating 
together (1) the 220bp PvuII/Haell fragment containing 
the relevant 156, 166 and/or 169 mutations from the 
5 respective pl56, pl66 and/or p!69 double of single 
mutant plasmid, (2) the 550bp Haell/BamHI fragment 
containing the relevant 217 mutant from the respective 
p217 plasmid, and (3) the 3.9kb PvuII/BamHI fragment 
containing the F50 mutation and vector sequences. 

10 

The multiple mutant F50/S156/A169/L217 , as well as B. 
amyloliguefaciens subtilisin, B. lichenformis 
subtilisin and the single mutant L217 were analyzed 
with the above synthetic polypeptides where the P-l 
15 amino acid in the substrate was Lys, His, Ala, Gin, 
Tyr, Phe, Met and Leu. These results are shown in 
Figures 26 and 27. 

These results show that the F50/S156/A169/L217 mutant 
20 has substrate specificity similar to that of the B. 
licheniformis enzyme and differs dramatically from the 
wild type enzyme. . 'Although only data for the L217 
mutant are shown, none of the single mutants (e.g., 
F50, S156 or A169) showed this effect. Although B. 
25 licheniformis differs in 88 residue positions from B. 
amyloliguefaciens , the combination of only these four 
mutations accounts for most of the differences in 
substrate specificity between the two enzymes. 

30 EXAMPLE 14 

Subtilisin Mutants Having 
Altered Alkaline Stability 

A random mutagenesis technique was used to generate 
single and multiple mutations within the B. 
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amvloliouef aciens subtilisin gene. Such mutants vere 
screened for altered alkaline stability. Clones 
having increased (positive) alkaline stability and 
decreased (negative) alkaline stability were isolated 
and sequenced to identify the mutations within the 
subtilisin gene. Among the positive clones, the 
5 mutants V107 and R213 were identified. These single 
mutants were subsequently combined to produce the 
mutant V107/R213. 

One of the negative clones (V50) from the random 
10 mutagenesis experiments resulted in a marked decrease 
in alkaline stability. Another mutant (P50) was 
analyzed for alkaline stability to determine the 
effect of a different substitution at position 50. 
The F50 mutant was found to have a greater alkaline 
15 stability than wild type subtilisin and when combined 
with the double mutant V107/R213 resulted' in a mutant 
having an alkaline stability which reflected the 
aggregate of the alkaline stabilities for each of the 
individual mutants. 

20 

The single mutant R204 and double mutant C204/R213 
were identified by alkaline screening after random 
cassette mutagenesis over the region from position 197 
to 228. The C204/R213 mutant was thereafter modified 

25 to produce mutants containing the individual mutations 
C204 and R213 to determine .the contribution of each of 
the individual mutations. Cassette mutagenesis using 
pooled oligonucleotides to substitute all amino acids 
at position 204 9 was utilized to determine which 

30 substitution at position 204 would maximize the 
increase in alkaline stability. The mutation from 
Lys213 to Arg was maintained constant for each of 
these substitutions at position 204. 

35 
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A. Construction of pB0180, an 

E. coli-B. subtilis Shuttle Plasmid 

The 2.9 kb EcoRI-BamHI fragment from pBR327 
(Covarrubias, L. , et al. (1981) Gene 13, 25-35) was 
ligated to the 3.7kb Eco RI- Bam HI fragment of pBD64 
(Gryczan, T., et al. (1980) J. Bacteriol. , 141, 
246-253) to give the recombinant plasmid pB0153. The 
unique Eco RI recognition sequence in pBD64 was 
eliminated by digestion with Eco RI followed by 
treatment with Klenow and deoxynucleotide 
triphosphates (Maniatis, T. , et al. (eds.) (1982) in 
Molecular Cloning, A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y.). Blunt 
end ligation and transformation yielded pB0154. The 
unique Ava l recognition sequence in pB0154 was 
eliminated in a similar manner to yield pB017*l. 
pB0171 was digested with BamH I and PvuII and treated 
with Klenow and deoxynucleotide triphosphates to 
create blunt ends. The 6.4 kb fragment was purified, 
ligated and transformed into LE392 cells (Enquest, 
L.W., et al. (1977) J. Mol. Biol. Ill , 97-120), to 
yield pB0172 which retains the unique Bam HI site. To 
facilitate subcloning of subtilisin mutants, a unique 
and silent Kpn l site starting at codon 166 was 
introduced into the subtilisin gene from pS4.5 (Wells, 
J. A., et al* (1983) Nucleic Acids Res. , 11, 7911-7925) 
by site-directed mutagenesis. The Kpn l+ plasmid was 
digested with Eco RI and treated with Klenow and 
deoxynucleotide triphosphates to create a blunt end. 
The Klenow was inactivated by heating for 20 min at 
68 °C, and the DNA was digested with BamH I. The 1.5 kb 
blunt Eco RI- Bam HI fragment containing the entire 
subtilisin was ligated with the 5,8 kb Nr u I -Bam HI from 
pB0172 to yield pBO180. The ligation of the blunt 
Nrul end to the blunt EcoRI end recreated an Eco RI 
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site. Proceeding clockwise around pB0180 from the 
EcoRI site at the 5' end of the subtilisin gene is the 
unique Bam HI site at the 3' end of the subtilisin 
gene, the chloramphenicol and neomycin resistance 
genes and UB110 gram positive replication origin 
derived from pBD64, the ampicillin resistance gene and 
gram negative replication origin derived from pBR327. 



B. Construction of Random Mutagenesis Library 

The 1.5 kb Eco RI- Bam HI fragment containing the B. 
amyloliquefaciens subtilisin gene (Wells et al., 1983) 
from pB0180 was cloned into M13mpll to give M13mpll 
SUBT essentially as previously described (Wells, J. A., 
et al. (1986) J. Biol. Chem. , 261 ,6564-6570) . 
Deoxyuridine containing template DNA was prepared 
according to Kunkel (Kunkel f T.A. (1985) Proc. Natl. 
Acad. Sci. USA , 82 488-492). Uridine containing 
template DNA (Kunkel, 1985) was purified by CsCl 
density gradients (Maniatis, T. et al. (eds.) (1982) 
in Molecular Cloning, A Laboratory Manual , Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y.). A 
primer (Aval ) having the sequence 

5 ' GAAAAAAG ACCCTAG CGTCGCTTA 

ending at codon -11, was used to alter the unique Ava l 
recognition sequence within the subtilisin gene. (The 
asterisk denotes the mismatches from the wild-type 
sequence and underlined is the altered Ava l site.) 

The 5 f phosphorylated Ava l primer (-320 pmol) and -40 
pmol (~120pg) of uridine containing M13mpll SUBT 
template in 1.88 ml of 53 mM NaCl, 7.4 mM MgC12 and 
7.4 mM Tris.HCl (pH 7.5) were annealed by heating to 
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90°C for 2 min. and cooling 15 min at 24°C (Fig. 31). 
Primer extension at 24 °C was initiated by addition of 
100pL containing 1 mM in all four deoxynucleotide 
triphosphates, and 20pl Klenow fragment (5 units/1). 
The extension reaction was stopped every 15 seconds 
over ten min by addition of 10yl 0.25 M EDTA (pH 8) to 

5 50pl aliquots of the reaction mixture. Samples were 
pooled, phenol chlorophorm extracted and DNA was 
precipitated twice by addition of 2.5 vol 100% 
ethanol, and washed twice with 70% ethanol. The 
pellet was dried, and redissolved in 0.4 ml 1 mM EDTA, 

10 10 mM Tris (pH 8) . 

Misincorporation of a-thiodeoxynucleotides onto the 3' 
ends of the pool .of randomly terminated template was 
carried out by incubating four 0.2 ml solutions each 

15 containing one-fourth of the randomly terminated 
template mixture <~20pg), 0.25 mM of a given 
a-thiodeoxynucleotide triphosphate, 100 units AMV 
polymerase, 50 mM KCL, 10 mM MgCl 2 , 0.4 mM 
dithiothreitol, and 50 mM Tris (pH 8.3) (Champoux, 

20 J.J. (1984) Genetics , 2, 454-464). After incubation 
at 37 °C for 90 minutes, misincorporation reactions 
were sealed by incubation for five minutes at 37 °C 
with 50 mM all four deoxynucleotide triphosphates (pH 
8) , and 50 units AMV polymerase. Reactions were 

25 stopped by addition of 25 mM EDTA (final) , and heated 
at 6 8°C for ten min to inactivate AMV polymerase. 
After ethanol precipitation and resuspension, 
synthesis of closed circular heteroduplexes was 
carried out for two days at 14 °C under the same 

30 conditions used for the timed extension reactions 
above, except the reactions also contained 1000 units 
T4 DNA ligase, 0.5 mM ATP and 1 mM B-mercaptoethanol. 
Simultaneous restriction of each heteroduplex pool 
with Kpnl f Bam HI , and EcoRI confirmed that the 
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extension reactions were nearly quantitative. 
Heteroduplex DNA in each reaction mixture was 
methylated by incubation with 80yM 

S-adenosylmethionine and 150 units dam methylase for 1 
hour at 37 °C. Methylation reactions were stopped by 
heating at 68 °C for 15 min. 

One-half of each of the four methylated heteroduplex 

reactions were transformed into 2.5 ml competent E. 

coli JM101 (Messing, J. (1979) Recombinant DNA Tech. 

Bull . , 2, 43-4 8) . The number of independent 

transformants from each of the four transformations 

ranged from 0.4-2.0 x 10 5 . After growing out phage 

pools, RF DNA from each of the four transformations 

was isolated and ..purified by centrifugation through 

CsCl density gradients. Approximately 2yg of RF DNA 

from each of the four pools was digested with EcoRI, 

Bam HI and Ava l. The 1.5 kb EcoRI -BamHI fragment 

(i.e., Ava l resistant) was purified on low gel 

temperature agarose and ligated into the 5.5 kb 

EcoRI -BamH I vector fragment of pB0180. The total 

number of independent transformants from each 

a-thiodeoxynucleotide misincorporation plasmid library 

4 

ranged from 1.2-2.4 x 10 . The pool of plasmids from 
each of the four transformations was grown out in 200 
ml LB media containing 12.5pg/ml cmp and plasmid DNA 
was purified by centrifugation through CsCl density 
gradients. 

C. Expression and Screening 

of Subtilisin Point Mutants 

Plasmid DNA from each of the four misincorporation 
pools was transformed (Anagnostopoulos , C. , et al. 
(1967), J. Bacterid. , 81, 741-746) into BG2036. For 
each transformation, 5vg of DNA produced approximately 
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2.5 x 10 5 independent BG2036 transformantB, and liquid 
culture aliguots from the four libraries were stored 
in 10% glycerol at 70 *C. Thawed aliguots of frozen 
cultures were plated on LB/5/*g/ml cmp/1.6% skim milk 
plates (Wells, J. A., et al. (1983) flucleic Acids Res. . 
11 , 7911-7925) , and fresh colonies were arrayed onto 
96-well microtiter plates containing 150 1 per well LB 
media plus 12.5pg/ml cmp. After 1 h at room 
temperature, a replica was stamped (using a matched 96 
prong stamp) onto a 132 mm BA 85 nitrocellulose filter 
(Schleicher and Scheull) which was layered on a 140 mm 
diameter LB/cmp/skim milk plate. Cells were grown 
about 16 h at 30 *C until halos of proteolysis were 
roughly 5-7 mm in diameter and filters were 
transferred directly to a freshly prepared agar plate 
at 37 *C containing only 1.6% skim milk and 50 mM 
sodium phosphate pH 11.5. Filters were incubated on 
plates for 3-6 h at 37 *C to produce halos of about 5 
mm for wild-type subtilisin and were discarded. The 
plates were stained for 10 min at 24 *C with Coomassie 
blue solution (0.25% Coomassie blue (R-250) 25% 
ethanol) and destained with 25% ethanol, 10% acetic 
acid for 20 min. Zones of proteolysis appeared as 
blue halos on a white background on the underside of 
the plate and were compared to the original growth 
plate that was similarly stained and destained as a 
control. Clones were considered positive that 
produced proportionately larger zones of proteolysis 
on the high pH plates relative to the original growth 
plate. Negative clones gave smaller halos under 
alkaline conditions. Positive and negative clones 
were restreaked to colony purify and screened again in 
triplicate to confirm alkaline pH results. 
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D. Identification and Analysis 0251446 
of Mutant Subtilisins 

Plasmid DNA from 5 ml overnight cultures of more 
alkaline active B.subtilis clones was prepared 
according to Birnboim and Doly (Birnboim, B.C., et al. 

(1979) Nucleic Acid Res. l_ t 1513) except that 
^ incubation with 2 mg/ml lysozyme proceeded for 5 min 

at 37°C to ensure cell lysis and an additional 
phenol/CHCl^ extraction was employed to remove 
contaminants. The 1.5 kb EcoRI-BamHI fragment 
containing the subtilisin gene was ligated into 
10 M13mpll and template DNA was prepared for DNA 
sequencing (Messing, J., et al. (1982) Gene , 19 
269-276) . Three DNA sequencing primers ending at codon 
26, +95, and +155 were synthesized to match the 
subtilisin coding sequence. For preliminary sequence 
identification a single track of DNA sequence, 
corresponding to the dNTPaS misincorporation library, 
from which the mutant came, was applied over the 
entire mature protein coding sequence (i.e., a single 
dideoxyguanosine sequence track was applied to 
identify a mutant from the dGTPas library) . A 
complete four track of DNA sequence was performed 200 
bp over the site of mutagenesis to confirm and 
identify the mutant sequence (Sanger, F. , et al., 

(1980) J. Mol. Biol. , 143, 161-178). Confirmed 
25 positive and negative bacilli clones were cultured in 

LB media containing 12.5pg/mL cmp and purified from 
culture supernatants as previously described (Estell, 
D.A., et al. (1985) J. Biol. Chem. , 260 , 6518-6521). 
Enzymes were greater than 98% pure as analyzed by 
SDS-polyacrylamide gel electrophoresis (Laemmli, U.K. 
(1970), Nature , 227 , 680-685), and protein 
concentrations were calculated from the absorbance at 

280 nm, e280 % = 1#17 < Maturbara » H. , et al. (1965), 
Biol. Chem , 240, 1125-1130). 



20 



30 



114- 



0251 



Enzyme activity was measured with 200 P g/mL 
succinyl-L-AlaL-AlaL-ProL-Phep-nitroanilide (Sigma) in 
0.1M Tris pH 8.6 or 0.1 M CAPS pH 10.8 at 25*C. 
Specific activity (p moles product/min-mg) was 
calculated from the change in absorbance at 410 nm 
from production of p-nitroaniline with time per mg of 
enzyme (E410 - 8,480 M-lcm-1; Del Mar, E.G., et al. 
(1979), frnal , Biochem. . 29, 316-320). Alkaline 
autolytic stability studies were performed on purified 
enzymes (200/ig/mL) in 0.1 M potassium phosphate (pH 
12.0) at 37 'C. At various times aliguots were assayed 
for residual enzyme activity (Wells, J. A., et al. 
(1986) J. Biol. Chem. , 261, 6564-6570) . 



E. Results 

1. Optimization and analysis 
of mutagenesis frequency 

A set of primer-template molecules that were randomly 
3» -terminated over the subtilisin gene (Fig. 31) was 
produced by variable extension from a fixed 5 '-primer 
(The primer mutated a unique Aval site at codon 11 in 
the subtilisin gene) . This was achieved by stopping 
polymerase reactions with EDTA after various times of 
extension. The extent and distribution of duplex 
formation over the 1 kb subtilisin gene fragment was 
assessed by multiple restriction digestion (not 
shown) . For example, production of new Hinf I 
fragments identified when polymerase extension had 
proceeded past IlellO, Leu233, and Asp259 in the 
subtilisin gene. 

Misincorporation of each dNTPas at randomly terminated 
3 1 ends by AMV reverse transcriptase (Zakour, R.A., et 
al. (1982), Nature , 295 , 708-710; Zakour, R.A., et al. 
(1984), Nucleic Acids Res. . 12, 6615-6628). used 
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conditions previously described (Champoux, J.J., 
(1984), Genetics r g, 454-464). The efficiency of each 
misincorporation reaction was estimated to be greater 
than 80% by the addition of each dNTPos to the Aval 
restriction primer, and analysis by polyacrylamide gel 
5 electrophoresis, Misincorporations were sealed by 
polymerization with all four dNTP^ and closed 
circular DNA was produced by reaction with DNA ligase. 

Several manipulations were employed to maximize the 
10 yield of the mutant sequences in the heteroduplex. 
These included the use of a deoxyuridihe containing 
template (Kunkel, T.A. (1985), Proc. Natl, Acad. Sci. 
USA . 82 488-492; Pukkila, P.J . et al. (1983), 
Genetics . 104 , ~ 571-582) , in vitro methylation of the 
15 mutagenic strand (Kramer, W. et al. (1982) Nucleic 
Acids Res. . 10 6475-6485) , and the use of Ava l 
restriction-selection against the wild-type template 
strand which contained a unique Ava l site. The 
separate contribution of each of these enrichment 
20 procedures to the final mutagenesis frequency was not 
determined, except that prior to Ava l restriction- 
selection roughly one-third of the segregated clones 
in each of the four pools still retained a wild-type 
Ava l site within the subtilisin gene. After Ava l 
25 restriction-selection greater than 98% of the plasmids 
lacked the wild-type Ava l site. 

The 1.5 kb EcoRI-BamHI subtilisin gene fragment that 
was resistant to Ava l restriction digestion, from each 

30 of the four CsCl purified M13 RF pools was isolated on 
low melting agarose. The fragment was ligated in situ 
from the agarose with a similarly cut £. coli -B. 
subtilis shuttle vector, pB0180, and transformed 
directly into E coli LE392. Such direct ligation and 

35 transformation of DNA isolated from agarose avoided 
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loses and allowed large numbers of recombinants to ^e 
obtained (>100,000 per pg equivalent of input M13 
pool) . 

The frequency of mutagenesis for each of the four 
dNTPas misincorporation reactions was estimated from 
5 the frequency that unique restriction sites were 
eliminated (Table XX) . The unique restriction sites 
chosen for this analysis, Clal, PvuII, and K£nl, were 
distributed over the subtilisin gene starting at 
codons 35, 104, and 166, respectively. As a control, 
10 the mutagenesis frequency was determined at the PstI 
site located in the B lactamase gene which was outside 
the window of mutagenesis. Because the absolute 
mutagenesis frequency was close to the. percentage of 
undigested plasmid DNA, two rounds of restriction- 
15 selection were necessary to reduce the background of 
surviving uncut wild-type plasmid DNA below the mutant 
plasmid (Table XX). The background of surviving 
plasmid from wild-type DNA probably represents the sum 
total of spontaneous mutations, uncut wild-type 
20 plasmid, plus the efficiency with which linear DNA can 
transform E. coli. Subtracting the frequency for 
unmutagenized DNA -(background) from the frequency for 
mutant DNA, and normalizing for the window of 
mutagenesis sampled by a given restriction analysis 
25 (4-6 bp) provides an estimate of the mutagenesis 
efficiency over the entire coding sequence (-1000 bp). 
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10 



a-thiol 
dNTP 

misincor- 
■ (b) 



Restriction 
Site 



% resistant clones 
1st 2nd 



% resistant 
clones over 



% 

mutants 
per 



porated Selection round round Total Background lOOObp 



None 


PstI 


0.32 


0.7 


0.002 


0 




G 


PstI 


0.33 


1.0 


0.003 


0.001 


0.2 


T 


PstI 


0.32 


<0.5 


<0.002 


0 


0 


C 


PstI 


0.43 


3.0 


0.013 


0.011 


3 


None 


Clal 


0.28 


5 


0.014 


0 




G 


Clal 


2.26 


85 


1.92 


1.91 


380 


T 


Clal 


0.48 


31 


0.15 


0.14 


35 


C 


Clal 


0.55 


15 


0.08 


0.066 


17 



15 


None 


PvuII - 


0.08 


29 


0.023 


0 






G 


PvuII 


0.41 


90 


0.37 


0.35 


88 




T 


PvuII 


0.10 


67 


0.067 


0.044 


9 




C 


PvuII 


0.76 


53 


0.40 


0.38 


95 



20 


None 


Kpnl 


0.41 


3 


0.012 


0 






G 


Kpnl 


0.98 


35 


0.34 


0.33 


83 




T 


Kpnl 


0.36 


15 


0.054 


0.042 


8 




C • 


Kpnl 


- 1.47 


26 


0.38 


0.37 


93 



25 

Mutagenesis frequency is estimated from the 
frequency for obtaining mutations that alter unique 
restriction sites within the mutagenized subtilisin 
gene (i.e., Cla l, Pvu II, or Kpn l) compared to mutation 
frequencies of the Pst I site, that is outside the 
window of mutagenesis. 

30 (b) 

Plasmid DNA was from wild- type (none) or 
mutagenized by dNTPas misincorporation as described. 

(c) 

Percentage of resistant clones was calculated 
from the fraction of clones obtained after three fold 
or greater over-digestion of the plasmid with the 
indicated restriction enzyme compared to a 

35 
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non-digested control . Restriction-resistant plasmid 
DNA from the first round was subjected to a second 
round of restriction-selection. The total represents 
the product of the fractions of resistant clones 
obtained from both rounds of selection and ^ gives 
percentage of restriction-site mutant clones in the 
original starting pool. Frequencies were derived from 
counting at least 20 colonies and usually greater than 
5 100. 

^ Percent resistant clones was calculated by 
subtracting the percentage of restriction-resistant 
clones obtained for wild-type DNA (i.e., none) from 
that obtained for mutant DNA. 

10 * e) This extrapolates from the frequency of mutation 
over each restriction site to the entire subtilisin 
gene (-1 kb) . This has been normalized to the number 
of possible bases (4-6 bp) within each restriction 
site that can be mutagenized by a given 
misincorporation event. 
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From this analysis, the average percentage of 
subtilisin genes containing mutations that result from 
dGTPas, dCTPas, or dTTPas misincorporation was 
estimated to be 90, 70, and 20 percent, respectively. 
These high mutagenesis frequencies were generally 
quite variable depending upon the dNTPas and 
misincorporation efficiencies at this site. 
Misincorporation efficiency has been reported to be 
both dependent on the kind of mismatch, and the 
context of primer (Champoux, J.J., (1984); Skinner, 
J. A., et al. (1986) Nucleic Acids Res. , 14 , 
6945-6964). Biased misincorporation efficiency of 
dGTPas and dCTPas over dTTPas has been previously 
observed (Shortle, D., et al. (1985), Genetics ', 110 , 
539-555). Unlike the dGTPas, dCTPas, and dTTPas 
libraries the efficiency of mutagenesis for the dATPas 
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misincorporation library could not be accurately 
assessed because 90% of the restriction-resistant 
plasmids analyzed simply lacked the' subtilisin gene 
insert. This problem probably arose from 
self-ligation of the vector when the dATPas 
mutagenized subtilisin gene was subcloned from M13 
into pB0180. Correcting for the vector background, we 
estimate the mutagenesis frequency around 20 percent 
in the dATPas misincorporation library. In a separate 
experiment (not shown) , the mutagenesis efficiencies 
for dGTPas and dTTPas misincorporation were estimated 
to be around 50 and 30 percent, respectively, based on 
the frequency of reversion of an inactivating mutation 
at codon 169. 

The location and identity of each mutation was 
determined by a single track of DNA sequencing 
corresponding to the misincorporated athiodeoxy- 
nucleotide over the entire gene followed by a complete 
four track of DNA sequencing focused over the site of 
mutation. Of 14 mutants identified, the distribution 
was similar to that reported by Shortle and Lin (1985) 
except we did not observe nucleotide insertion or 
deletion mutations. The proportion of AG mutations 
was highest in the G misincorporation library, and 
some unexpected point mutations appeared in the dTTPas 
and dCTPas libraries. 



2. Screening and Identification of 

Alkaline Stability Mutants of Subtilisin 

30 

It is possible to screen colonies producing subtilisin 
by halos of casein digestion (Wells, J. A. et al. 
(1983) Nucleic Acids Res. , 11, 7911-7925) • However, 
two problems were posed by screening colonies under 
high alkaline conditions (>pH 11). First, B. subtilis 
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will not grow at high pH, and we have been unable to 
transform an alkylophilic strain of bacillus. This 
problem was overcome by adopting a replica plating 
strategy in which colonies were grown on filters at 
neutral pH to produce subtilisin and filters 

5 subsequently transferred to casein plates at pH 11.5 
to assay subtilisin activity. However, at pH 11,5 the 
casein micells no longer formed a turbid background 
and thus prevented a clear observation of proteolysis 
halos. The problem was overcome by briefly staining 

10 the plate with Coomassie blue to amplify proteolysis 
zones and acidifying the plates to develop casein 
micell turbidity. By comparison of the halo size 
produced on the reference growth plate (pH 7) to the 
high pH plate (pH 11.5), it was possible to identify 

15 mutant subtilisins that had increased (positives) or 
decreased (negatives) stability under alkaline 
conditions. 

Roughly 1000 colonies were screened from each of the 
20 four mis incorporation libraries. The percentage of 
colonies showing a differential loss of activity at pH 
11*5 versus pH 7 represented 1.4, 1.8, 1.4, and 0.6% 
of the total colonies screened from the thiol dGTPas, 
dATPas, dTTPas, and dCTPas libraries, respectively. 
25 Several of these negative clones were sequenced and 
all were found to contain a single base change as 
expected from the misincorporation library from which 
they came. Negative mutants included A3 6. E170 and 
V50. Two positive mutants were identified as V107 and 
30 R213. The ratio of negatives to positives vas roughly 
50:1. 



35 



-121- 



0251446 



3. Stability and Activity of 

Subtilisin Mutants at Alkaline p H 

Subtilisin mutants were purified and their autolytic 
stabilities were measured by the time course of 
inactivation at pH 12.0 (Figs. 32 and 33). Positive 
mutants identified from the screen (i.e., V107 and 
R213) were more resistant to alkaline induced 
autolytic inactivation compared to wild-type; negative 
mutants (i.e., E170 and V50) were less resistant. We 
had advantageously produced another mutant at position 
50 (F50) by site-directed mutagenesis. This mutant 
was mor.e stable than wild-type enzyme to alkaline 
autolytic inactivation (Fig. 33) At the termination 
of the autolysis study, SDS-PAGE analysis confirmed 
that each subtilisin variant had autolyzed to an 
extent consistent with the remaining enzyme activity. 

The stabilizing effects of V107, R213, and F50 are 
cumulative. See Table XXI, The double mutant, 
V107/R213 (made by subcloning the 920 bp EcoRI-Kgnl 
fragment of pB0180V107 into the 6.6 kb JEcoRI-Kpnl 
fragment of pB0180R213), is more stable than either 
single mutant. - The triple mutant, F50/V107/R213 (made 
by subcloning the 735 bp EcoRI-Pvuir fragment of pF50 
(Example 2) into the 6.8 kb EcoRI-PvuII fragment of 
PB0180/V107, is more stable than the double mutant 
V107/R213 or F50. The inactivation curves show a 
biphasic character that becomes more pronounced the 
more stable the mutant analyzed. This may result from 
some destablizing chemical modification (s) (eg., 
deamidation) during "the autolysis study and/or reduced 
stabilization caused by complete digestion of larger* 
autolysis peptides. These alkaline autolysis studies 
have been repeated on separately purified enzyme 
batches with essentially the same results. Rates of 
autolysis should depend both on the conformational 



~ 124 ~ 0251446 

F. Random Cassette Mutagenesis 
of Residues 197 through 228 

Plasmid pA222 (Wells, et al. (1985) Gene 34, 315-323) 
was digested with PstI and BamH l and the 0.4 kb 
Pst l/ Bam HI fragment (fragment 1, see Fig. 34) purified 
from a polyacrylamide gel by electroelution. 



The 1.5 kb Eco RI/ BamH I fragment from pS4.5 was cloned 
into M13mp9. Site directed mutagenesis was performed 
to create the A197 mutant and simultaneously insert a 
silent Sst I site over codons 195-196. The mutant 
EcoR I/ Bam HI fragment was cloned back into pBS42. The 
pA197 plasmid was digested with Bam HI and Sst I and the 
5.3 kb MmHI/ Sst I fragment (fragment 2) was purified 
from low melting agarose. 

Complimentary oligonucleotides were synthesized to 
span the region from Sst I (codons 195-196) to Pst I 
(codons 228-230) • These oligodeoxynucleotides were 
designed to (1) restore codon 197 to the wild type, 
(2) re-create a silent Kpn l site present in pa222 at 
codons 219-220, (3) create a silent Smal site over 
codons 210-211, and (4) eliminate the Pst I site over 
codons 228-230 (see Fig. 35). Oligodeoxynucleotides 
were synthesized with 2% contaminating "nucleotides at 
each cycle of synthesis, e.g., dATP reagent was spiked 
with 2% dCTP, 2% dGTP, and 2% dTTP. For 97-mers, this 
2% poisoning should give the following percentages of 
non-mutant, single mutants and double or higher 
mutants per strand with two or more misincorporations 
per complimentary strand: 14% non-mutant, 28% single 
mutant, and 57% with &2 mutations, according to the 
general formula 

n 

35 f * * — e-p . 

nl 
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where p is the average number of mutations and n is a 
number class of mutations and f is the fraction of the 
total having that number of mutations. Complimentary 
oligodeoxynucleotide pools were phosphorylated and 
annealed (fragment 3) and then ligated at 2-fold molar 
excess over fragments 1 and 2 in a three-way ligation. 

E. coli MM294 was transformed with the ligation 
reaction, the transformation pool - grown up over night 
and the pooled plasmid DNA was isolated. This pool 
represented 3.4 x 10 independent transformants. This 
plasmid pool was digested with Pst I and then used to 
retrahsform E. coli . A second plasmid pool was 
prepared and used to transform B. • 6ubtilis (BG2036) . 
Approximately 40% of the BG2036 transformants actively 
expressed subtil isin as judged by halo-clearing on 
casein plates. Several of the non-expressing 
transformants were sequenced and found to have 
insertions or deletions in the synthetic cassettes. 
Expressing BG2036 mutants were arrayed in microtiter 
dishes with lSOpl of LB/12. 5pg/mL chloramphenicol 
(cmp) per well, incubated at 37 *C for 3-4 hours and 
then stamped in duplicate onto nitrocellulose filters 
laid on LB 1.5% skim milk/5/jg/mL cmp plates and 
incubated overnight at 33 # C (until halos were 
approximately 4-8 mm in diameter). Filters were then 
lifted to stacks of filter paper saturated with 
1 x Tide commercial grade detergent, 50 mM Na 2 C0 3 , pH 
11.5 and incubated at 65*C for 90 min. .Overnight 
growth plates were Cosmassie stained and destained to 
establish basal levels of expression. After this 
treatment, filters were returned to pH7/skim 
milk/20/xg/mL tetracycline plates and incubated at 37 *C 
for 4 hours to overnight. 
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Mutants identified by the high pH stability screen to 
be more alkaline ( stable were purified and analyzed for 
autolytic stability at high pH or high temperature. 
The double mutant C204/R213 was more stable than wild 
type at either high pH or high temperature (Table 
XXII). 

This mutant was dissected into single mutant parents 
(C204 and R213) by cutting at the unique Sma l 
restriction site (Fig. 35) and either ligating wild 
type sequence 3' to the Sma l site to create the single 
C204 mutant or ligating wild type sequence 5' to the 
Smal site to create the single R213 mutant. Of the 
two single parents, C204 was nearly as alkaline stable 
as the parent double mutant (C04/R213) and slightly 
more thermally "stable. See Table XXII. The R213 
mutant ' was only slightly more stable than wild type 
under both conditions (not shown) . 

Another mutant identified from the screen of the 197 
to 228 random cassette mutagenesis .was R204. This 
mutant was more stable than wild type at both high pH 
and high temperature but less stable than C204. 

25 
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TABLE XXII 

Stability of subtilisin variants 

Purified enzymes (200Mg/mL) were incubated in 0.1M 
phosphate, pH 12 at 30* C for alkaline autolysis, or in 
2mM CaCl 2 , 50mM MOPS, pH 7.0 at 62 *C for thermal 
autolysis. At various times samples were assayed for 
residual enzyme activity. Inactivations were roughly 
pseudo-first order, and t 1/2 gives the time it took 
to reach 50% of the starting activity in two separate 
experiments . 



t 1/2 t 1/2 

25 (alkaline (thermal 

autolysis) autolysis) 



10 



20 



25 



Subtilisin variant 


Exp. 
#1' 


Exp. 

n 


Exp. 
#1 


Exp. 
#2 


wild type 


30 


25 


20 


23 


F50/V107/R213 


49 


41 


18 


23 


R204 


35 


32 


24 


27 


C204 


43 


46 


38 


40 


C204/R213 


50 


52 


32 


36 


L204/R213 


32 


30 


20 


21 



G. Random Mutagenesis at Codon 204 

30 Based on the above results, codon 204 was targeted for 
random mutagenesis. Mutagenic DNA cassettes (for 
codon at 204) all contained a fixed R213 mutation 
which was found to slightly augment the stability of 
the C204 mutant. 
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Plasmid DNA encoding the subtilisin mutant C204/R213 
was digested with Sst I and EcoRI and a 1.0 kb 
EcoRI/SstI fragment was isolated by electro-elution 
from polyacrylamide gel (fragment 1, see Fig. 35). 

5 

C204/R213 was also digested with Sroa l and Eco RI and 
the large 4.7 kb fragment, including vector sequences 
and the 3» portion of coding region, was isolated from 
low melting agarose (fragment 2, see Fig. 36). 

10 

Fragments 1 and 2 were combined in four separate 
three-way ligations with heterophosphorylated 
fragments 3 (see Figs. 36 and 37). This hetero- 
phosphorylation of synthetic duplexes should 

15 preferentially drive the phosphorylated strand into 
the plasmid ligation product. Four plasmid pools, 
corresponding to the four ligations, were restricted 
With Sma l in order to linearize any single cut 
C204/R213 present from fragment 2 isolation, thus 

20 reducing the background of C204/R213. E. coli was 
then re-transformed with Smal-restricted plasmid pools 
to yield a second set of plasmid pools which are 
essentially free of C204/R213 and any non-segregated 
heterduplex material. 

25 

These second enriched plasmid pools were then used to 
transform B. subtilis (BG2036) and the resulting four 
mutant pools were screened for clones expressing 
subtilisin resistant to high pH/temperature 
30 inactivation. Mutants found positive by such a screen 
were further characterized and identified by 
sequencing. 

The mutant L204/R213 was found to be slightly more 
35 stable than the wild type subtilisin. See Table XXII. 
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Having described the preferred embodiments of the 
present invention, it will appear to those ordinarily 
skilled in the art that various modifications may be 
made to the disclosed embodiments, and that such 
modifications are intended to be within the scope of 
the present invention. 
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CLAIMS; 

1. A carbonyl hydrolase mutant having at least one 
property which is substantially different from the 
same property of a precursor carbonyl hydrolase from 
which the amino acid sequence of said carbonyl 
hydrolase mutant is derived, said property being 
selected from the group consisting . of thermal 
stability and alkaline stability wherein said 
precursor carbonyl hydrolase is selected from the - 
group consisting of naturally occurring carbonyl 
hydrolases and recombinant carbonyl hydrolases and 
said carbonyl hydrolase mutant amino acid sequence is 
derived by a method selected from the group consisting 
of the substitution, deletion and insertion of at 
least one amino acid in said amino acid sequence of 

15 said precursor carbonyl hydrolase. 

2. A carbonyl hydrolase mutant having at least one 
property which is substantially different from the 
same property of a precursor carbonyl hydrolase from 

20 

which the amino acid sequence of said carbonyl 
hydrolase mutant is derived, said property .being 
selected from the group consisting of oxidative 
stability, substrate specificity, catalytic activity, 
thermal stability, alkaline stability and pH activity 
profile wherein said precursor carbonyl hydrolase is 
selected from the group consisting of naturally 
occurring carbonyl hydrolases and recombinant carbonyl 
hydrolases and said carbonyl hydrolase mutant amino 
acid sequence is derived by a method selected from the 
group consisting of deletion and insertion of at least 
one amino acid in said amino acid sequence of said 
precursor carbonyl hydrolase and substitution of more 
than one amino acid residue of said amino acid 
sequence of said precursor carbonyl hydrolase. 
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3. A carbonyl hydrolase mutant derived by the 
replacement of at least one amino acid residue of a 
precursor carbonyl hydrolase with a different amino 
acid, said one amino acid residue being selected from 

5 the group of amino acid residues of Bacillus 
amyloliquefaciens subtilisin consisting of Tyr21, 
Thr22, .Ser24, Ser33, Asp36, Ala45, Gly46, Ala48, 
Ser49, Met50, Asn77, Ser87, Lys94, Val95, Leu96, 
Tyrl04, Ilel07 f GlyllO, Metl24, Asnl55, Glul56, 

10 Lysl70 f Tyrl71, Prol72 f Phel89, Aspl97, Metl99, 
Ser204, Lys213, Tyr217, Ser221, His67, Leul26, Leul35, 
Gly97, Asp99, SerlOl, Glyl02, Glul03 f Leul26, Glyl27, 
Glyl28, Prol29, Tyr214, Gly215, and equivalent amino 
acid residues in other precursor carbonyl hydrolases, 

15 

4. A carbonyl hydrolase mutant having an amino acid 
sequence derived from the amino acid sequence of a 
precursor carbonyl hydrolase by the substituion of a 
differnt amino acid for more than one amino acid 

20 residue' of said amino acid sequence of said precursor 
carbonyl hydrolase, said amino acid , residues being 
selected from the group of amino acid residues of 
Bacillus amyloliquefaciens subtilisin consisting of 
Tyr21, Thr22, Ser24, Asp32, Ser33, Asp36, Ala45, 

25 Gly46, Ala48, Ser49, Met50, Asn77, Ser87, Lys94, 
Val95, Leu96, Tyrl04, Ilel07, GlyllO, Metl24, Alal52, 
Asn-155, Glul56, Glyl66, Glyl69, Lysl70, Tyrl71, 
Prol72, Phel89, Aspl97, Metl99, Ser204, Lys213, 
Tyr217, Ser221, Met222, His67, Leul26, Leul35, Gly97, . 

30 Asp99, SerlOl, Glyl02, Glul03, Leul26, Glyl27, Glyl28, 
Prol29, Tyr214, Gly215, and equivalent amino acid 
residues in other precursor carbonyl hydrolases. 
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5. The mutant of Claim 4 wherein said combinations 
are selected from the group consisting of Thr22/Ser87, 
Ser24/Ser87, Ala45/Ala48, Ser49/Lys94, Ser49/Val95, 
Met50/Val95, Met50/Glyll0, Met50/Metl24 , Met50/Met222, 
Metl24/Met222, Glul56/Glyl66 , Glul56/Glyl69, 
Glyl66/Met222, Glyl69/Met222 / Tyr21/Thr22, 
Met50/Metl24/Met222, Tyr21/Thr22/Ser87, Met50/Glul56/ 
Glyl66/Tyr217, Met50/Glul56/Tyr217 , Glul56/Glyl69/ 
Tyr217, Ilel70/Lys213 , Ser204/Lys213, Met50/Ilel07/ 
Lys213 and Ser24/Met50/Ilel07/Glul56/Glyl66/Glyl69/ 
Ser204/Lys213/Gly215/Tyr217. 

6. A carbonyl hydrolase mutant derived by the 
replacement of at least one amino acid residue of a 
precursor carbonyl hydrolase with a different amino 
acid, said one amino acid residue being selected from 
the group of amino acid residues of of Bacillus 
amy lo lique f ac iens subtilisin consisting of Tyr21, 
Thr22, Ser24, Asp32, Ser33, Asp36, Ala45, Gly46, 
Ala48, Ser49, MetSO, Asn77, Ser87, Lys94, Val95 f 
Leu96, Tyrl04, Ilel07, GlyllO, Metl24 f Alal52, 
Asn-155, Glul56, . Glyl66, Glyl69 f Lysl70 f Tyrl71, 
Prol72, Phel89, Aspl97, Metl99, Ser204, Lys213, 
Tyr217, Ser221, Met222 f His67, Leul26, Leul35, Gly97, 
Asp99, SerlOl, Glyl02, Glul03, Leul26 f Glyl27, Glyl28, 
Prol29, Tyr214, Gly215, and equivalent amino acid 
residues in other precursor carbonyl hydrolases, 
wherein said at least one amino acid residue of said 
precursor carbonyl hydrolase is replaced with the 
amino acid residues listed in TABLE I and TABLE II 
herein. 

7. The mutant of Claim 6 wherein the amino acid 
replacing said at least one amino acid residue in said 
precursor carbonyl hydrolase is selectd from the 
replacement amino acids listed in TABLE I herein. 
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8. Mutant DNA sequence encoding the mutant of claims 
1 through 7. 



9. Expression vector containing the mutant DNA 
sequence of claim 8. 

5 

10. Host cell .transformed with the expression vector 
of Claim 9. 

10 
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£«?RI Clo\ PvuW BomHl 



bo 



P RBS ^PRE^PRO MAT TERM 

I.5kb -H 



RBS 



.1*7 
fwet 

1 oGTCTACTAAAATATTATTCCA TACTATA CAATTMTACACAGAATAATCTGTCTAT T6GTTATTCTGCAAATGAAAAAAAK GAGAGG ATAAAGA GTG 

-inn PRE .on 

Arq Gly Lys Lys Val Trp Me Ser leu Leu Phe Ala Leu Ala Leu lie Phe Thr Met Ala Phe Gly Ser Thr Ser 
99 AGA GGC AAA AAA GTA TGG ATC AGT TTG CTG TTT GCT TTA GCG TTA ATC TTT ACG ATG GCG TTC GGC AGC ACA TCC 

.80 -70 PRO ■ -60 

Ser Ala Gin Ala Ala Gly Lys Ser Asn Glv Glu Lvs Lys Tyr lie Val Gly Phf Lys Gin Thr Ket Ser Thr Het 
174 TCT 6CC CA6 GCG GCA GGG AAA TCA AAC GGG GAA AAG AAA TAT ATT GTC GGG TTT AAA CAG ACA ATG AGC ACG ATG 

-50 -40 
Ser Ala Ala Lys Lys Lys Asp Val lie Ser Glu Lys Gly Gly Lys Val Gin Lvs Gin Phe Lvs Tvr Val As/i Ala 
?49 AGC GCC GCT AAG AAG AAA GAT GTC ATT TCT GAA AAA GGC GGG AAA GTG CAA AAG CAA TTC AAA TAT GTA GAC GCA 

-30 -10 
Ala Ser Ala Thr Leu Asn Glu Lys Ala Val Lys Glu Leu Lvs Lys Asp Pro Ser Val Ala Tvr Val Glu Glu *sd 
324 GCT TCA GCT ACA TTA AAC GAA AAA GCT GTA AAA GAA TTG AAA AAA GAC CCG AGC GTC GCT TAC GTT GAA GAA GAT 

-i IT- MAT 

His Val Ala His Ala Tyr Ala Gin Ser Val Pro Tyr Gly Val Ser Gin Fie Lys Ala Pro Ala Leu His Ser Gin 
399 CAC GTA GCA CAT GCG TAC GCG CAG TCC GTG CCT TAC GGC GTA TCA CAA ATT AAA GCC CCT GCT CTG CAC TCT CAA 

20 30 40 

Gly Tyr Thr Gly Ser Asn Val lys Val Ala Val f If Asp Ser Gly He Asp Ser Ser His Pro Asn Leu lvs Val 
474 GGC TAC ACT GGA TCA AAT GTT AAA GTA GCG GTT ATC GAC AGC GGT ATC GAT TCT TCT CAT CCT GAT TTA AAG G1A 

50 Pro Asn 60 Asp 

Ala Gly Gly Ala Ser Ket Val Pro Ser Glu Thr Asn Pro Phe Gin Asp Asn Asn Ser His Glv Thr His Val Ala 
549 GCA GGC GGA GCC AGC ATG GTT CCT TCT GAA ACA AAT CCT TTC CAA GAC AAC AAC TCT CAC GGA ACT CAC GTT GCC 

70 BO Ser Ala 90 

Gly Thr Val Ala Ala Leu Asn Asn Ser Me Gly Val Leu Gly Val Ala Pro Ser Ala Ser Leu Tvr Ala Val Lvs 
6?4 GGC ACA GTT GCG GCT CTT AAT AAC TCA ATC GGT GTA TTA GGC GTT GCG CCA AGC GCA TCA CTT TAC GCT GTA AAA 

Asp Ala 100 110 
Val Leu Gly Ala Asp Gly Ser Gly.61n Tyr Ser Trp He He Asn Gly lie Glu Trp Ala lie Ala Asn Asn *et 
699 GTT CTC GGT GCT GAC GGT TCC GGC CAA TAC AGC TGG ATC ATT AAC GGA ATC GAG TGG GCG ATC GCA AAf AAT ATG 



120 130 140 

Asp Val lie Asn Met Ser Leu Glv Gly Pro Ser Glv Ser Ala Ala Leu Lvs Ala Ala V*1 a sn (vs Ala v*l Ma 
774 GAC GTT ATT AAC ATG AGC CTC GGC GGA CCT TCT GGT TCT GCT GCT TTA AAA GCG GCA GTT GAT AAA GCC Git GC* 

150 Ser Thr 160 

Ser Gly Val Val Val Val Ala Ala Ala Gly Asn Glu Gly Thr Ser Gly Srr Ser Ser Thr Val Gly Tvr Pro Glv 
849 TCC GGC GTC GTA GTC GTT GCG GCA GCC GGT AAC GAA GGC ACT TCC GGC AGC TCA AGC AC* GTG GGC TAC f CT GGT 

170 180 190 

Lvs Tyr Pro Ser Val lie Ala Val Glv Ala Val Asp Ser Ser Asn Gin Aro Ala Ser Phe Ser Ser Val Gly Pro 
924 AAA TAC CCT TCT GTC ATT GCA GTA GGC GCT GTT GAC AGC AGC AAC CAA AG A GCA TCT TTC TCA AGC GTA GGA CCT 

200 210 
Glu Leu Asp Val Met Ala Pro Gly Val Ser fie Gin Ser Thr Leu Pro Gly Asn Lvs Tvr Gly Ala Tvr ijn Glv 
999 GAG CTT GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA AGC ACG CTT CCT GGA AAC AAA TAC GGG GCG TAC AAC GGT 

??0 230 ?40 

Thr Ser Met Ala Ser Pro His Val Ala Gly Ala Ala Ala Leu He Leu Ser Lvs His Pro Asn Trp Thr Asn Thr 
1074 ACG TCA ATG GCA TCT CCG CAC GTT GCC GGA GCG GCT GCT TTG ATT CTT TCT AAG CAC CCG AAC TGG ACA AAC ACT 

250 Gin 260 
Gin Val Arq Ser Ser Leu Glu Asn Thr Thr Thr Lvs leu Gly Asp Ser Phe Tyr Tyr Glv lvs Glv Leu He Asn 
1149 CAA GTC CGC AGC AGT TTA GAA AAC ACC ACT ACA AAA CTT GGT GAT TCT TTC TAC TA1 GGA AAA GGG CTG ATC AAC 

VM Gin Ala Ala Ala Gin OC TERM 
1224 GTA CAG GCC GCA GCT CAG TAA AAC ATAAAAAACCGGC CTTGGCCCC GCCGGTTTTTTATT A TTTTT CTTCCTCCGCATGTTCAATCCGCTCC 

1316 ATAATCGACGGATCGCTCCCTCTGAAAATnTAACGAGAAACGGCGGCTTGACCCGGCTCAGTCCCGTAACGGCCAAGTCCTGAAACGTCTCAATCGCCG 

1 41 6 CnCCCGGTTTCCGGTCAGCTCAATGCCGTAACGGTCGGCGGCGTTTTCCTGATACCGCGAGACGCCATTCCTAATCGGATC FIG.— I 
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Homology of BeclHus proteases 

1. Bacillus enylollQUifeclens 
2:B0clllu9 tubtllls vtr.IIBB 
3. Bacillus lichenlfornis (earlsberejensis ) 
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W.T A.A.: 


195 
Glu 


Leu 


200 

Asp Val Met Ala Pro Gly Val Ser 


lie 


206 
Gin 


W.T. DNA; 


GAG 
CTC 


CTT 
GAA 


GAT 
CTA 


GTC 
CAG 


ATG 
TAC 


GCA 
CGT 


CCT 
GGA 


GGC 
CCG 


GTA 
CAT 


TCT 
AGA 


ATC 
TAG 


CAA 
GTT 


pA222DNA: 


GAG 
CTC 


CTT 
GAA 


GAT 
CTA 


GTC 
CAG 


ATG 
TAC 


GCA 
CGT 


CCT 
GGA 


GGC 
CCG 


GTA 
CAT 


TCT 
AGA 


ATC 
TAG 


CAA 
GTT 


A197 DNA: 
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* * 
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CAT 


TCT 
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ATC 
TAG 
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GTT 



Sstl 

Frngmetti from GAG-CT 
pA222mdA197 Cp 
cul w/ Pul* Sstl: 



pA222, A 197 GAG CTC GAT GTC ATG GCA CCT GGC GTA TCT ATC CAA 
cutAligEled C TC GAG CTR CAS TAC CGT GGA CCG CAT AGA TAG GTT 
w/ oligodeoxy- Sstl 
aicicotidc pools: 

207 210 218 

W.TA.A.: ser Thr Leu Pro Gly Asn Lys Tyr Gly Ala Tyr Asn 

wt nuA- ACG CTT CCT GGA Mc TAC GGG GCG TAC wc 

w.i.uixa- TCG TGC GAA GGA CCT TTG TT;p ATG ccc CGC ATG TTG 

pA222DNA: AGC ACG' CTT CCT GGA AAC AAA TAC GGG GCG TAC AAC 
TCG TGC GAA GGA CCT TTG TTT ATG CCC CGC ATG TTG 

A197DNA: AGC ACG CTT CCT GGA AAC AAA TAC GGG GCG TAC AAC 
TCG TGC GAA GGA CCT TTG TTT ATG CCC CGC ATG TTG 

FruiQcxAi from * * 

DA2£2«xi A197 AGC ACG CTT ccc GGG AAC MA TAC GGG GCG TAC Mc 
MwlPstl Sstl: TCG TGC GA A GGG CCC TTG TTT ATG CCC CGC ATG TTG 

Smal 
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AGT- 
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CGC- 


■5' 



W.T A.A.: 
W.T. DNA: 

P&222DNA: 

A197 DNA: 



PGGA GCG-3' 

I>A7 * 1 A ^ GGT ACC TCA ATG GCA TCT CCG CAC GTT GCA GGA GCG-3 " 
CU&llgtied CCA TGG AGT TAP CRT AGA GGC GTG CAA CGT CCT CGC-5" 
w/ol.godcoxy- K nl frfldeaiuyed 

cucleoude pools: r ' 

Oligodeoxynudeotide pools synthesized with 2% contaminating nucleotides in cadi cycle lo give 

-15% of pool with 0 muuiions, -28% of pool with single mutations, and 

-57% of pool with 2 or more mutations, according to tlvc general formula f e-^j-e"* 1 . 
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Fragments 3 

Heterophosphorylated 
duplexes 

(see Fig. 4) 



1, Transform E. coli 



2. Digest 4 plasmid pools with Smal, 
retransfor m E. coli 

3. Transform B. subtilis (BG203R) 
with 4 second pools 

4. Screen 



FIG.— 36 
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